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PREFACE 


With this volume, the sixth to appear under his editorship, Paul Mussen retires as 
editor to gain time for his many other activities. Mark R. Rosenzweig will continue 
as an editor, and Lyman W. Porter moves from the Editorial Committee to become 
the other editor. The Annua/ Review of Psychology has benefitted greatly and in 
many ways from Paul Mussen’s broad knowledge of both psychology and psycholo- 
gists and from his incisive editing. The next two volumes will continue to show his 
contributions since he helped to guide the most important task in preparing these 
volumes—the selection of significant topics and able authors. 

This volume also marks the end of the regular 5-year term of Professor W. D. 
Neff on the Editorial Committee. He has brought to the Committee his deep and 
wide-ranging knowledge and his dedication to the concept of the Annual Reviews. 
Lyman Porter’s assumption of an editorship opened another position on the Com- 
mittee. Conrad G. Mueller and Marian Radke-Yarrow have accepted the invitation 
of the Board to fill these positions for the next 5- and 6-year periods, respectively. 

Another important contributor to the continuity of the Annual Review of Psy- 
chology is Miss Jean Heavener, who became assistant editor in 1968. She partici- 
pates fully in every stage of the editorial enterprise, from the meetings of the 
Committee to the final corrections of page proof, and she responds cheerfully to the 
exigencies of a rapid production schedule and occasional strained deadlines. Chris- 
tina Smillie ably assisted her with some of the copy editing for this volume. 
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DEVELOPMENTAL - #210 
PSYCHOLOGY 


Donald M. Baer and John C. Wright? 


Department of Human Development, University of Kansas, Lawrence 


INTRODUCTION 


A selective review of current developmental psychology obviously will hinge on the 
meaning of developmental: what is selected should be that work most relevant to 
the essence of the field. Developmental psychology may have a unique problem in 
this regard, in that it may represent the one chapter in an Annual Review of 
Psychology which needs to question whether it exists as a reviewable field. The 
alternative, taken in its extreme form, would be to argue that developmental psy- 
chology is nothing more that all psychology in the small, that is, located in children 
and other young organisms; consequently, such studies might well be reviewed as 
special cases (the young cases) in other chapters. For the most part, developmental 
psychology has proceeded as if this alternative were untenable. Theories such as 
Freud’s and Piaget’s have postulated uniquely developmental processes occurring 
in young children, to some extent qualitatively different from the processes typical 
of adults, and centrally important in determining the shape of adult psychology. 
These theories and similar logics have generated a very large amount of research, 
and that reasearch has constituted a developmental psychology, even if it did not 
prove that it always deserved the name. Its failure as a widely convincing proof is 
evident in the existence of a number of learning theories of child development as 
exemplified by Sears, Dollard, and Miller, Staats, and Bijou and Baer. Such theories 
suggest that much of developmental phenomena represent behavior change accom- 
plished by the same environmental mechanisms that can and do produce behavior 
change in the adult organism. But these approaches, too, obviously fall short of 
widescale conviction of their audience, for research derived from the Piagetian 
paradigm clearly is increasing in relative frequency. And even if learning theories 
ordinarily do not cite different mechanisms of learning in the young organism from 
those in the adult, nevertheless they do typically point to a uniquely childlike 
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content of what is learned, and thus take on a tactical significance within psychology 
requiring more than isolated paragraphs in an Annual Review chapter on learning. 

In summary, the most critical research for review may well be those studies which 
suggest that there are uniquely developmental phenomena and those which suggest 
that there are not; however, whether or not there is a developmental psychology, 
there certainly is a child psychology. Even if the child is only a small case of the 
adult organism, he is an extremely common case (indeed, inevitable); the phenom- 
ena occurring in the child may not be different in kind from what will occur later, 
but they may very well have major import for what will happen later. They may 
be clearer in the child than in the adult; if there are troubles to be avoided or 
diminished for adults, the child may represent the best possible case in which to take 
preventive action. And, finally, the child is another case in which the generality of 
any principle may be tested—those laws that hold for adults but not for children 
are less general than those that hold for both. Thus a good deal of the research 
produced in this field is not aimed at a developmental psychology, but simply at a 
child psychology, and it also merits review. 

The best organizing variables of a child psychology are probably age and se- 
quence. Childhood is a process of change, and the change is very well organized (if 
not explained) by the child’s age. Observation suggests that a great deal of that 
change occurs in very common sequences, highly similar from child to child. Conse- 
quently, a considerable amount of research in this area consists of age comparisons 
or descriptions of sequences and the difficulty of finding or producing different 
sequences. A large amount of the basic description required for this organization 
is already in hand (students in the field: are regularly assigned the works of Gesell 
and his colleagues as the basic example and substance of the approach). However, 
as society changes, the situations within which children develop also change, and 
new descriptive research is required. 

If a large part of the justification for a child psychology is the inevitability of 
children, another large part must be their sentimental and strategic value to us. Our 
affection for the child hardly needs comment, but our strategic commitment to 
intervention in his life does. Adults live in a society which children cannot join and 
maintain without extensive behavioral preparation. Thus we socialize and school 
them, formally and informally, and consequently much of a child psychology must 
be an applied psychology, a conclusion exemplified by the most recent edition of the 
Review of Child Development Research (48). Yet application, often undertaken for 
no more theory-serving purpose than because children exist in large numbers and 
have troubles (or will have, without intervention), may eventually have a profound 
effect on the issue of a developmental psychology. Behavior modification approaches 
to applied child psychology, for example, have been pointedly nondevelopmental in 
their views and their procedures; nevertheless, they have reported some striking 
successes in changing certain behaviors. Conceivably such applications may con- 
tinue to demonstrate that a sufficient diversity of child behavior can be modified at 
such a range of ages and in such a wealth of sequences as to make a developmental 
psychology unimportant. This possibility has been suggested by Baer (10). 

The review that follows will be organized in part around the issue of the existence 
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of a developmental psychology, but in larger part according to the habitual topics 
of our existing child psychology, which only sometimes is concerned with whether 
it is a developmental psychology. Consequently, it will be seen that studies are 
grouped in honor of the fact that children perceive, learn, imitate, speak, symbolize, 
think, become socialized, and display personalities; and studies are chosen because 
they suggest that while all humans do these things, children may do them distinc- 
tively. 


DEVELOPMENTAL PSYCHOLOGY AS A DISCIPLINE 


The first topic of review 1s a sample of work especially relevant to the concept of 
a developmental psychology. Sometimes the relevance is explicit: it is research or 
commentary directly about the character that a developmental psychology has or 
should take on. Sometimes the relevance is implicit: the research or argument allows 
tentative conclusion that there are developmental issues per se; or, by contrast, 
suggests that what is called development (in the particular case at hand) may simply 
be a standard, nondevelopmental process which happens to be under study in a 
young organism. (Only a few examples of the studies that might be reviewed under 
this heading will be considered; most of the possible candidates instead are reviewed 
under some other substantive topic heading.) 

That developmental psychology was itself a developing ‘‘organism” seemed to be 
the view of Looft (249). His article considered three handbooks of child psychology 
published at various points in the last 40 years, together with a recent handbook on 
the psychology of aging. A variety of measures were derived from the comparison, 
both quantitative and qualitative, to demonstrate a developmental (“evolutionary”) 
process in the discipline. Looft suggested a three-part value in doing so: historical 
perspective avoids egocentrism; it sometimes shows that new ideas are in fact old 
ones; and, by demonstrating that the current state of the discipline is to some extent 
a function of its early history, it encourages understanding of the present (out of 
which understanding may come a new, perhaps better direction for the future?). 

Riegel (326), in elaboration on a similar theme, made a fraternal twin study of 
the development of the discipline. He described two parallel trends, defining in effect 
two subdisciplines, by distinguishing the “capitalistic” orientation typical of English 
and American psychology from the “mercantilistic-socialistic” orientation of Eu- 
ropean psychology. The significant formative figures of the former ranged from 
Hobbes to Hall, and of the latter from Rousseau to Piaget. The central theme of 
the capitalistic orientation seemed to consider development as a homogeneous and 
continuous process, thereby allowing comparison of all individuals to a single nor- 
mative standard (e.g. the intelligence test, as derived from white, achievement- 
oriented Americans). By contrast, the mercantilistic-socialistic orientation allowed 
and encouraged concurrent multiple models of growth; evaluation could well be 
specific to a generation or to a culture or subculture. Riegel’s description of the two 
trends is followed by a strong recommendation for modern developmental psy- 
chology, based upon an interaction model emphasizing both sociocultural contin- 
gencies and biological process, to incorporate both of the earlier trends into a new 
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integration. These recommendations require that a developmental psychology relate 
behavior not only to personal, individual time (age), but to historical time as well. 
Consequently, further recommendations result concerning the nature of research 
design in developmental psychology. 

A similar dissatisfaction with much current research design was voiced by Lewis 
(239, pp. 75-76) in the introduction to a (1971 SRCD) symposium on cross-cultural 
studies of mother-infant interaction. Lewis argued that the proper task of develop- 
mental research was to delineate psychological process. But variables such as cul- 
ture, sex, and social class are not psychological processes; instead, they are carner 
variables within which psychological processes are likely to be located. Unfortu- 
nately, he noted, there was a strong tendency in modern research merely to demon- 
strate carrier variable differences (between cultures, sexes, classes): “Any test or 
procedure that can accomplish this end is held in esteem, regardless of whether the 
results add any substantial datum to our understanding” (239, p. 75). His argument 
applies equally well to age differences as to culture, sex, and class differences. In the 
symposium which followed, there were several examples of research in the mode 
recommended. Goldberg (144), for instance, noted that Zambian infants differ from 
American infants in that Zambian infants spend a great deal of their time in a sling 
on their mother’s back. Goldberg theorized about the psychological processes al- 
tered by this practice and their consequent effect on motor, social, and cognitive 
development. She may not have proved her final point, but she had at least moved 
in the recommended direction. 

Finally, the notion that our concept of developmental psychology can be a self- 
fulfilling prophecy with grave social consequences was voiced twice in the past year. 
Tulkin (401) couched the point in terms of the concept of cultural deprivation. The 
concept itself, he argued, ignored cultural relativism, which Riegel also had pointed 
out was unappreciated in American psychology yet well represented in European 
orientations. To the extent that ‘‘deprivation” was taken literally, social scientists 
found themselves obliged to accomplish “enrichment” in reply, and this made them 
missionaries rather than scientists. In that the problems underlying the invocation 
of a cultural deprivation theme were (Tulkin argued) a result of certain features of 
the majority culture, this misdirected the scientist from the more pertinent task of 
analyzing, recommending, and designing the needed social changes within that 
majority culture. Furthermore, the concept of cultural deprivation was not (in 
Lewis’ terms) a psychological process but instead was a carrier variable and hence 
should be left behind in favor of more analytic studies. Those studies, very likely, 
would reinforce the idea of cultural relativism rather than deprivation. 

Eisenberg (104) made a similar point in terms of aggression. He argued that a 
developmental discipline which conceives of man as instinctively aggressive (espe- 
cially in the sense of inevitably aggressive) may not only be in error, but may in fact 
cause the error to approximate truth. In effect, Eisenberg argued for a developmen- 
tal psychology in which biological and environmental variables are in constant 
interaction; and the environmental variables certainly include sociocultural, politi- 
cal variables as very potent factors in determining human development. Thus a 
culture whose science believes that its members are inevitably aggressive is likely to 
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design itself in such a manner as to model, condone, and even reinforce the aggres- 
sion of its members, thereby “proving” to itself that it was correct. On the other 
hand, "men and women must believe that mankind can become fully human in order 
for our species to attain its humanity” (104, p. 124). Consequently, the striving for 
ethological parallels across species becomes a dangerous exercise, because the uni- 
formity of aggressive behavior across species is not accompanied by a uniformity 
of controlling mechanism. 


For example, attack behavior can be observed in organisms as varied as insect, bird, 
carnivore, ape, and man. In the first, it may be triggered by trace chemicals; in the second, 
by territorial defence, but only during the breeding season; in the third, by prey, but only 
if the appropriate internal state of arousal is present; in the fourth, by the appearance of 
a predator, if escape routes are unavailable and if the troop is threatened; and in man, 
by a mere verbal slur, if the social context and prior individual experience indicate attack 
as the socially appropriate response (104, p. 125). 


Pointing to a remarkable variety of human behavior patterns across different but 
viable cultures, Eisenberg called for a developmental psychology that assumed that 
human behavior was thoroughly malleable, and suggested that studies of child 
behavior change were the proper arena in which to see not if behavior was plastic, 
but rather how much behavior change a given organism could tolerate and remain 
an integrated psychic entity. The issue was perhaps survival. (A much more aca- 
demic but nonetheless pertinent parallel can be drawn from Eisenberg’s argument. 
If the science of a culture thinks of its members as inevitably aggressive, the culture 
may make it true. Then if the science of a culture believes that its members’ 
development is age scheduled, will the culture not make that true, too? And if the 
science of a culture believes that its members’ development occurs in fixed se- 
quences, can the culture also make that true?) 

Seven studies will be considered next which are less commentary or argument 
about the nature of developmental psychology than they are powerful examples of 
one view or another at work. 

To set the stage, consider a study by Brainerd (33), comparing age and stage 
(readiness) as factors in the training of a number conservation skill. Nonconserving 
kindergarteners were pretested; then they were either trained or not; and after a 
week of training (or waiting), they were posttested. As a group, the trained children 
improved in the conservation skill (relative to untrained children), but with consid- 
erable individual differences. A partial correlational analysis of those differences 
showed that the only variable predicting profit from training was the child’s stage, 
not his age, at the outset of training. The implication was that there are certain 
prerequisites to being taught a conservation skill; in other words, development does 
come in fixed sequences (but not at fixed ages). 

Next consider a study reported by Lane & Coon (231), dealing with reward-giving 
behavior by preschoolers. Children 4 and 5 years old performed a task, supposedly 
in collaboration with a partner; the partner’s work was seen to be either better than, 
the same as, or worse than theirs. The children then received a number of rewards 
for the task and were instructed to give their partner a share. Four-year-olds 
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generally took more than half for themselves, regardiess of the partner’s perfor- 
mance; by contrast, five-year-olds divided the rewards equally between themselves 
and the partner, again regardless of the partner’s performance. Adults, it was 
claimed, would have allocated the rewards on an equity basis: whoever did more 
would receive more. The implication again is that there is a sequence of change in 
how rewards are given away, and in this instance, the sequence was roughly predict- 
able by age. 

Campbell & Spear (49) also pointed to age and sequence in their review of the 
ontogeny of memory. Their summary of a large number of studies indicated clear 
evidence that long-term memory capacity increases markedly with development. In 
offering an accounting of that fact, the authors invoked both neurological and 
behavioral mechanisms either known or presumed to be scheduled along the path 
of normal development. 

The preceding three papers had much in common in terms of affirming (or 
assuming) that there is indeed a discipline of developmental psychology to be 
studied as such. Consider now four other studies of the past year which in this sense 
proceeded differently. 

Groen & Parkman (157) offered a simple model as an analysis of addition skills 
in children and adults. They assumed that any organism, young or old, possesses 
a counter. Presented with an addition problem of the form add m and a, the 
organism sets the counter to the larger of the two, increments it a number of times 
equal to the smaller of the two, and reads the result. Latency in producing the 
answer should then be a function of how many times the counter must be incre- 
mented, and both children and adults yielded latencies in addition which corre- 
sponded to this prediction. Adults, however, have very much shorter latencies than 
children, even though ordered according to the same pattern. It was postulated then 
that adults have an auxiliary memory-based “look-up” process which they usually 
use, but that they sometimes revert to the counter-incrementing technique charac- 
terizing the child. In this approach, there is clearly a developmental sequence from 
child to adult, based on a memory process. If the Campbell & Spear account of 
memory is taken, then to the extent that memory is a developmental, sequenced 
process, so is the transition from childlike addition to adult addition. But at the same 
time, the Groen & Parkman model postulated a fairly specific memory process. If 
that “look-up” behavior can be translated into a teachable set of skills, then it might 
be found that with appropriate techniques, the skills could be taught relatively early 
in childhood, allowing transition at any (teachable) age, but always in the same 
sequence! On the other hand, it might also be shown that the memory process a/one 
might be invoked sufficiently to give a child apparent addition skills in the absence 
of the counting mechanism. If so, then the counting mechanism might be added later 
on top of the memory technique (as schools do when they explain the concept of 
number line to a child who has already memorized a set of “addition tables”). 
Conceivably then, sequence would have been reversed, and a fast adder might 
become (for a time) a slow one, as he gave up memory for counter incrementing. 
Such a discussion is only speculative, of course. The Groen & Parkman model, if 
useful, simply illustrates the possibility that usual sequences need not prove to be 
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necessary sequences—although the belief that they are may well determine the 
structure of the schools’ curriculum in teaching addition. 

Bond (29) reviewed studies dealing with infant visual perception, looking for 
evidence about the infant’s capacity for form perception. This led her to consider 
visual discrimination behavior as well as visual scanning patterns. Despite her 
pursuit of the possibility, she could find no evidence requiring any belief in different 
infant perception processes than characterized the adult. She suggested that the 
qualitative equivalence of infant and adult perception was still the sound—as well 
as the prevailing—view. 

Similarly, Kramer, Koff & Luria (225) examined child and adult speech to assess 
competence in the use of a grammatical exception rule (the minimal distance princi- 
ple), and failed to find uniform competence anywhere between the ages of 8 and 20. 
The linguistic stage structure originally described for children below 8 years of age 
also characterized children and adults above age 8. Two years after the original 
assessment, those subjects who had not been competent in using the rule were 
retested. Improvement was found equally for all age levels within that group: “no 
evidence for greater improvement of competency for the younger ‘language-plastic’- 
aged Ss was found” (p. 121). Thus these authors, like Bond, found no distinctively 
developmental phenomena in the area of behavior they examined. And these areas 
seem no more delimited or trivial than those in which developmental phenomena 
are asserted. 

Finally, consider the report by Koegel & Covert (218) concerning the interaction 
between self-stimulatory behavior in three autistic children and their ability to learn 
discriminations under ordinary training procedures. In general, these authors found 
that when the children were engaged in self-stimulation (bizarre mannerisms, ata- 
visms, etc), straightforward reinforcement contingencies frequently failed to estab- 
lish a simple discrimination skill. However, if the self-stimulatory behavior was 
brought under control (by experimental punishment), the children then responded 
to those same contingencies and learned the discrimination. Furthermore, there 
were cases observed in which the reinforcement contingencies alone were sufficient 
to produce discrimination; in those cases, as the discrimination was learned, self- 
stimulatory behavior always was low. In other words, in these children, learning or 
its failure seemed to be under the direct control of the rate of another behavior. To 
the extent that their current environment failed to control that behavior, these 
children presumably would continue to fail under ordinary contingencies. If the 
interfering self-stimulatory behavior showed some developmental course of decline, 
then presumably positive development (learning the lessons presented by ordinary 
environmental contingencies) would begin. But self-stimulatory behavior, whatever 
its developmental course may be in such children, was demonstrably sensitive to 
environmental contingencies itself, specifically, punishment contingencies. For such 
children virtually any pattern of learning, or failure to learn, could be displayed, 
depending on what the environment chose to do about the children’s self-stimula- 
tory behavior. But as Koegel & Covert clearly demonstrate, the environment was 
in the hands of the children’s caretakers for these purposes. Those caretakers could 
program punishment, reinforcement, or nothing for self-stimulatory behavior, and 
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thereby would control what might otherwise appear to be a developmental process 
or a developmental anomaly. 

The work presented in this field sometimes touches directly on the issue of 
whether there is a field here to be called developmental psychology, rather than one 
better labeled child psychology. What is striking is the relative rarity with which 
either side of the issue is directly attacked. Perhaps the most salient conclusion to 
be drawn from a review of the past year’s research in this area is that it still has 
not demonstrated that it requires its chapter title, and it is not working very hard 
on doing so. 


Developmental Prediction 


The tradition of longitudinal study continues, and the most noteworthy publication 
was a study by Wilson & Harpring (426) of 261 pairs of twins from birth to 2 years, 
using the Bayley Scales. A quote from the abstract: 


The results showed that (a) with a small allowance for prematurity, the twins displayed 
essentially the same rate of progression in development as singletons; (b) the correlations 
between developmental scores obtained at successive ages were moderate and declined in 
size as the ages spread further apart; (c) at each age, however, the twins showed high 
within-pair correlations for mental and motor development, and they also displayed high 
concordance for the rate of gain across ages; (d) the correlations for monozygotic twins 
approached the reliability of the test itself, and most were significantly higher than the 
correlations for dizygotic twins; and (e) infant mental and motor development was tempo- 
rarily retarded by low birth weight, but was not affected by the socioeconomic status of 
the home. The conclusion from these results is that while prenatal anomalies or an 
impoverished home environment may retard development, in the majority of cases the 
environmental conditions fall within the limits of sufficiency that permit the genetic 
blueprint to determine the course of infant development. 


Another optimistic attempt at prediction has been launched, and a pilot study 
together with a promissory note has been published by Smith et al (369). A novel 
feature of this correlational study relating prenatal, perinatal, and postnatal events 
to developmental outcome at 7 years is the use of “sequential prediction stages” to 
limit the use of any predictor to that time interval beyond which its contribution 
to predictive accuracy is negligible. 


Miscellaneous Methodology 


An extremely heterogeneous group of studies are mentioned below because of their 
contribution of methodological innovations, cautions, or reassuring discoveries that 
certain variables traditionally a source of concern to research design needn’t be. 
With regard to spatial placement of the correct stimulus in two-choice discrimina- 
tion tasks, Moody & Gollin (287) have demonstrated the inadequacy of the hoary 
Gellerman and the newer Fellows series. Apparently children are ingenious enough 
to develop response patterns (hypotheses?) which will gain unintended confirmation 
in almost any left-right sequence. Therefore, instead of offering a better sequence, 
the authors wisely suggest four methods of identifying, anticipating, or individually 
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eliminating the systematically irrelevant patterning in children’s responses. These 
are not panaceas, but all are potentially helpful. 

Two of the “booming” areas in the field have produced seminovel methodological 
contributions: infancy and imitation. Infant head turning (67, 68) has soluble prob- 
lems as a conditioning task. A debate on errors in the corneal reflection method of 
determining infant visual fixation was launched in a paper by Slater & Findlay (364). 
The replies of the Yale group and the continuing debate will doubtless be discussed 
in next year’s Annual Review of Psychology. The continuing problem of locating 
drug-free neonates for research has led to increased attention to the effects of 
maternal perinatal pharmacology on neonatal behavior, and this effort has encoun- 
tered its own methodological difficulties (224). 

In the imitation arena, it has been demonstrated that an important determinant 
of both imitation and vicarious reinforcement effects is whether S knows during 
observation of M that he will later be expected to perform (394). Not surprisingly, 
the 20-year-old Asch technique for assessment of conformity has been applied to the 
study of imitation and perspective taking in the presence of peers (85, 417). 

Finally, a study of normal and retarded children’s P-A learning indicated fewest 
trials to criterion when items were presented by tape recorder; slower learning when 
they were presented by slides; and poorest learning when the venerable memory 
drum was used (214). 


PERCEPTUAL DEVELOPMENT 


In addition to the increasing number of studies of perceptual development bordering 
on the areas of learning and cognition discussed by subheads below, in 1972 there 
were studies of simple age changes in visual masking (160, 282); eidetic imagery 
(161); shape recognition under free vs sequentially self-programmed conditions 
(141); and hue, saturation, and brightness discrimination of colors (132). Two 
studies assessed auditory discrimination of speech sounds in early infancy (288, 
399); and two extended the literature on olfaction in newborns (338, 351). Anchor 
effects in weight discrimination were studied developmentally by Wilkening, Sarris 
& Heller (425) in Germany. 


illusions 


The temporal range within which apparent movement is perceived by preschool 
children declines with age when measured by a response-decrement model, but not 
when assessed by verbal report (246). The decline in strength of the Poggendorff 
illusion with age in relation to the fact that adults experience a successive-trials 
attenuation while children do not was discussed by Bayer (15). Pollack’s theory of 
the ontogenetic development of illusions was tested by Matheny (262), using the 
Ponzo illusion (a secondary one that should increase with age) with MZ twins who 
differed in susceptibility. In support of Pollack’s view, females, but not males, 
showed a within-twin correlation between susceptibility and nonverbal WISC sub- 
tests. Pollack fared less well in two other studies claiming to test his model. Hart- 
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mann and associates (175) failed to find correlations between the magnitude of the 
Müller-Lyer, allegedly a primary illusion, and age or IQ in children 6 to 9 years old. 
Brightness contrast and cognitive factors were examined in a study of the Delboef 
illusion by Weintraub & Cooper (418). who concluded that although the magnitude 
declined with increasing age, cognitive factors were implicated. Sjostrom & Pollack 
(363) criticized the adequacy and the assumption of equivalence of Weintraub & 
Cooper’s methods in rejoinder. 


Spatial Orientation 


A lively continuing interest in children’s perception with regard to orientation of 
objects, orientation of the body, localization with respect to the body, and spatial 
perspective is evidenced in the literature this year. McGurk (267) found that infants 
as young as 6 months of age (but not 3 months) can discriminate different orienta- 
tions of the same form and can also recognize their identity. Adults were compared 
to 3- and 5-year-olds in a study by Gajzago & Day (133) when, following discrimina- 
tion of a figure from its 180° rotation, the subject’s head was inverted and he was 
required to identify the previously reinforced alternative. Only 50-75 percent of 
adults showed uprightness constancy under these conditions and fewer children 
did so, especially when the figure represented a different object in its inverted 
form. 

Left-right discriminations were studied in the Piagetian context of development 
of logical relations by Harris (171), and horizontal vs vertical discrimination was 
analyzed by Mackay, Brazendale & Wilson (254). The latter found that the age of 
acquisition depended heavily upon task parameters and could develop in either 
order, depending upon the tests selected. Linear age effects and no sex differences 
in the development of directional concepts between ages 6 and 12 were found by 
Long & Looft (248), who ordered the various forms of directional competence in 
a sequence of development and confirmed previously obtained orders. Egocentric 
and nonegocentric comprehension of the spatial terms “front,” “back,” and “be- 
side” were studied by Harris & Strommen (172). 

Children’s changing comprehension of spatial relations is mediated by both social 
and cognitive factors, according to a theoretical model proposed by Fishbein, Lewis 
& Keiffer (119). Older children make fewer errors than younger children, but a 
higher percentage of those they make are egocentric errors. Correct responses and 
egocentric errors have equal latencies, both shorter than those for nonegocentric 
errors. The analysis is reminiscent of White’s temporal zones of automation and 
decision, respectively. Brodzinsky, Jackson & Overton (36) used Piaget’s classic 
mountain scene perspective task to demonstrate that nonegocentric performance 
increases with age from 6 to 8 to 10 years. Reasoning that salient perceptual features 
of the scene as viewed may interfere with the transformations required for the 
construction of anticipatory images, they visually shielded S from temporarily 
uninformative features of the array in order to reduce their perceptual impact, and 
found that shielding facilitated performance for 8- and some 10-year-olds, but not 
for 6-year-olds. 
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Perception and Representation of Human Faces and Figures 


The studies grouped under this heading use heterogeneous tasks, but have in com- 
mon an interest in the social significance of human figure perception and drawing. 
Harris & Fleer (170) used full faces and half faces in a recognition memory task, 
and showed that retardates require full-face stimuli for accurate responding, unlike 
their matched M.A. controls. Odom & Lemond (297) predicted a developmental lag 
between the perception and the production of facial expressions from kindergarten 
to fifth grade children, but found instead that both age groups discriminated more 
of the eight expressions used than they could produce, and the difference increased 
with age, though older children excelled at both tasks. 

Reasoning from a model based on language development, Goodnow & Friedman 
(149) designed a task in which children 3 to 6 years old were to complete a facial 
drawing presented to them as a circle containing two dots (eyes) placed either 
vertically on one side of the circle or horizontally but low in the circle. Resultant 
drawings could either follow figural cues (90° and 180° rotation, respectively) or 
could add details that made the face upright on the page in spite of the original 
display. The former solution was predominant, especially among the younger chil- 
dren. Analysis of this response as a qualitatively unique and developmentally inter- 
mediate pattern stressed the view that development in graphic representation, as in 
language, cannot be adequately described as successive approximation to adult 
behavior. 

The failure of psychodynamic factors to influence the size of human figure draw- 
ings by children was emphasized in a study by Black (26) in which age, actual height, 
and numerically self-estimated height accounted for most of the variability. Viewed 
developmentally, the construction of human figures in a three-dimensional medium 
follows an orderly developmental sequence, according to the findings of Golomb 
(148) with children aged 2 to 6 years. 


Cross-modal Perception 


Cross-modal perceptual generalization is an increasingly utilized task setting from 
which to evaluate what is perceived and at what level of informational generality. 
Using the auditory and visual modalities, Vlietstra & Wright (404) demonstrated 
that certain stimulus properties such as intensity and interruptedness, trained in 
preschoolers as a discrimination within a visual or auditory dimension (with the 
other modality irrelevant), provided a basis for transfer to the previously irrelevant 
modality, but only if the previously reinforced value of the transmodal property 
remained correct in the new modality. Visual feedback training in auditory localiza- 
tion brought younger preschool children up to the level of both trained and un- 
trained older children in auditory localization accuracy in a study by Warren & Judy 
(409). In a study of rhythm reproduction by 10- and 15-year-olds, Rileigh & Odom 
(327) showed that normal vs deficient hearing accounted for less variance than did 
salience of visually presented rhythm patterns in interaction with age. Beats were 
most salient for the 10-year-olds, followed by duration, and rhythm, which was most 
salient in the reproductions of 15-year-olds. Auditory-visual matches of pattern 
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were more difficult than intramodal matches, and sequential spatial matches were 
still more difficult in a study of pattern-matching in relation to reading ability by 
Bryden (41). Poor readers were inferior to good ones on all kinds of match, and they 
showed a high within-group correlation between intermodal matching accuracy and 
reading ability, unlike the good readers. When between-modality matching tasks 
and within-modality matching tasks begin with the same modality, there are no 
differences between them in matching accuracy, but those that begin with visual and 
transfer to either auditory or visual are easier than those that begin with auditory 
and transfer to either modality, according to a study by Kuhiman & Wolking (228). 
A number of studies considered transfer between the visual and tactual kines- 
thetic modalities. Fry & Craven (130) compared visual to active and passive tactual 
discrimination of horizontal and vertical discriminations. Jones (194) compared 
blind and sighted children on visual, cutaneous, and kinesthetic localization tasks. 
Rose, Blank & Bridger (331) found no differences in visual-visual, visual-tactual, 
tactual-tactual, and tactual-visual matching performances by 3-year-olds. Delays 
between presentation and matching, however, produced deficits for all conditions 
involving a tactual component, suggesting perhaps the greater potency of visual 
imagery in short-term memory. Even 6-month-old infants performing a Piaget-type, 
permanent-object assessment task, demonstrate the relative potency of visual over 
tactual cues, as demonstrated in an ingenious experiment by Gratch (151). 


Attention: Stimulus Factors 


This section is concerned with absolute stimulus features as determinants of selec- 
tive attending, features leading to general statements about salience and preference, 
more or less regardless of the long- and short-term preceding experiences of the 
child. Color vs form preferences (with size as a third option) were investigated in 
first graders during three sessions spaced over 6 months in a study by Offenbach, 
Baecher & White (298). Form was consistently preferred to color, which was 
preferred to size, and of those Ss who changed dominance patterns between sessions, 
about as many shifted from form to color as from color to form. Smiley (367) 
produced a convincing demonstration that intradimensional variance influences 
preference for color or form. Although no main effect of age (kindergarten vs first 
grade) appeared in initial preferences, making the cues on either dimension more 
similar induced preference shifts to the other dimension in a second test. 
Normai and distorted faces and figures as three-dimensional stimuli were pre- 
sented to 1-, 2-, and 3-year-olds in Boston and Yucatan by Finley, Kagan & Layne 
(118). One-year-olds looked longest at normal stimuli and 3-year-olds attended 
longest to the most distorted figures. Neither sex nor culture differences were 
observed. Bartol & Pielstick (12) found an age-by-sex interaction in a study of 
looking time at ambiguous and unambiguous figures. The former elicited more 
attention in general, especially among sixth graders, as compared with second 
graders and adults, with males reaching their peak attention to ambiguity earlier 
than females. Dots, stripes, and checkerboards at three levels of complexity were 
shown to infants aged 6 and 11 weeks by Greenberg & O’Donneil (152). In all cases 
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the 11-week-olds spent more time fixating the complex patterns and less time on the 
simple patterns than did the 6-week-olds. 

McGurk (268) showed that preschoolers readily discriminated between stimuli 
differing only in orientation, but when color and size were covaried with orientation, 
the latter came out a poor third. Kessen, Salapatek & Haith (210) explain on the 
basis of scanning patterns their finding that vertical contours attract the newborn’s 
visual attention, while horizontal ones appear to have no effect whatever. A proce- 
dure for independently assessing attention-getting vs attention-holding properties of 
checkerboard designs was devised for infants by Cohen (71). He demonstrated that 
latency of turning toward the pattern (getting) was more determined by overall size 
of the stimulus, while duration of fixation (holding) was more determined by the 
number of squares it contained. Jones-Molfese (199) compared full-term and prema- 
ture neonates in their looking times at planometric vs stereometric checkerboard 
patterns presented in pairs. The two samples did not differ on planometric patterns 
varying in amount of contour, but they did differ on stereometric patterns varying 
in contour, texture, and shading. 


Attention: Habituation and Novelty 


The general discrepancy hypothesis as cited by Super et al (390) states that attention 
to a stimulus bears an inverted U-shaped relationship to the discrepancy between 
the actual stimulus and the person’s schema for it. Their data support the hypothesis 
by demonstrating such a function describing infants’ attention to a standard labora- 
tory mobile as a function of its discrepancy from different mobiles to which the 
infants had been exposed daily for the preceding 3 weeks at home. A similar function 
approximately describes the relationship between the temporal and spatial un- 
predictability of sequential light flash patterns and the suppression of limb move- 
ments and sucking (components of the orienting response) in 4-month-old infants, 
but not in 2-month-olds, whose orienting responses were generally weaker (74). 
Response decrement to repeated exposure and recovery to stimulus change were 
demonstrated with checkerboards in neonates by Friedman (128). The same dis- 
habituation to novel stimuli was demonstrated with upright faces in 5- and 6-month- 
old infants, especially females, by. Fagan (109). The same faces presented less 
realistically or inverted were less well discriminated. 

In a test of Jeffrey’s serial habituation hypothesis for the formation of schemas, 
Miller (281) first determined (by looking times) the initial saliency of three compo- 
nents of a standard stimulus separately. She then presented her 4-month-old male 
Ss with the standard repeatedly to produce habituation, following which the compo- 
nents were again presented separately. The dishabituation data suggested that 
habituation of the components had been occurring in the order of their initial 
saliency. Other theories, of course, might make the same prediction, but the design 
is ingenious and very worthy of replication and extension. Similar support for 
Jeffrey’s hypothesis might be drawn from a study of infant cardiac responses to 
two-tone auditory sequences (184). Dishabituation (response recovery to novel 
stimuli) was obtained only when the first tone in the sequence was changed, and only 
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then if sufficient habituation had occurred prior to presentation of the stimulus 
change. Berg (22) also demonstrated habituation and dishabituation of cardiac 
responses to auditory stimuli. Unusually rapid habituation was obtained, along with 
smaller orienting responses to offset of continuous stimuli. 

Three studies explored novelty-familiarity preferences in more naturalistic set- 
tings. Twenty infants were studied repeatedly from 6 to 12 months of age by 
Schaffer, Greenwood & Parry (344). Familiarization with an intially strange object 
was followed by presentation of an incongruous object in each session, and visual 
responsiveness indicated within-session habituation and dishabituation. No effect of 
familiarity and congruousness on touching latency was observed until 8 months, 
after which prolonged hesitation in touching unfamiliar stimuli appeared. Parry 
(307) also studied visual fixation and manipulative latency to the same stimuli in 
familiar (home) and unfamiliar (lab) settings, finding more visual attention and 
higher latency of approach to novel stimuli in the familiar setting. Ross, Rheingold 
& Eckerman (336) demonstrated that 12-month-old infants entered the more novel 
of two rooms upon leaving their mothers, and played with the more novel of two 
equally available toys. 

Finally, Siebold (356) demonstrated that fifth- and sixth-grade children prefer 
novel to familiar but meaningless stimuli (Welsh figures) as a function of degree of 
familiarization, regardless of the complexity of the stimuli or the length of delay 
between familiarization and rating (up to 7 days). 


Attention: Patterns of Exploration 


This rather heterogenous group of attention studies includes patterns and sequences 
of attending, scanning, or exploratory behavior that appear strongly to influence 
perception. Pick, Christy & Frankel (310) compared second and sixth graders’ 
reaction times as they compared selected aspects of pairs of objects, either being 
informed of the relevant aspect before or after presentation. While the older children 
had faster reaction times, the difference between the times for the informed and the 
uninformed conditions was greater for the older than for the younger children. 
Deliberate selective attention thus becomes more discriminating and efficient with 
age. The spatial separation of manipulandum and locus of reinforcement was stud- 
ied by Millar & Schaffer (279) in 6-, 9-, and 12-month-old infants using operant 
conditioning and a noncontingent pseudoconditioning control. All Ss learned under 
zero and 5° displacements, but only the older two groups learned when the displace- 
ment was 60°. 

Twins between 5 and 11 years of age were given a wide set of arrays to explore 
by Matheny (263). Although exploration became more systematic and less error- 
prone with age, no left-right patterns were observed for arrays with strong Gestalt 
properties, contrary to the previous finding of Elkind and Weiss. Left-to-right 
scanning has been implicated in the bowed serial position curve for recall of strings 
of letters presented tachistoscopically (left best; right next; middle worst). This 
finding is reversed for readers of Hebrew. In a study by Keenan (205), children from 
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grades two, four, and six who read both Hebrew and English showed left primacy 
for recall of strings of English letters and right primacy when the letters were 
Hebrew. 

Two experiments by Jones (195) illustrate the value of active, isomorphic move- 
ments in psychophysical judgements by school children. When Ss moved their arms 
through a length, it was more accurately reproduced than when either their arm was 
moved passively or when no movement accompanied initial exposure. The effects 
were replicated in a second experiment with six-sided random forms. The results 
increase the credibility of feedback effects frequently reported informally by 
Zaporozhets. 

One last study, an investigation of direction of serial processing of geometric 
stimuli, though hardly conclusive, may lead to interesting replications and exten- 
sions. Braine (32) studied recognition of figures as a function of vertical orientation 
(right-side-up vs upside-down). Stimuli were designed with a strong vertical axis so 
that a feature defined as focal for young children was always at one end and a 
distinguishing cue for recognition at the other end. Error analysis indicated that 
younger children start with the focal feature and proceed downward, regardless of 
its location, while older children and adults start at the top and process downward 
regardless of the location of focal features. 

What is needed for this and many other types of studies of perceptual develop- 
ment is a practical, high-speed eye-movement recorder that uses a spare field of a 
Gerbrands-type tachistoscope without interfering with its other function. Designers 
note: the camera need only be activated during the few seconds before, during, and 
after each tachistoscopic exposure, and the child’s head could be free to move at all 
other times. 


LEARNING 


A very considerable amount of research effort is still devoted to the demonstration 
that what is known about learning in general (usually derived from the animal 
laboratory) is also characteristic of children’s learning. To some extent, this is 
open-minded research, asking a simple generality question: whether children fit the 
same principles as the usual laboratory subjects. However, such research can have 
profound implications for developmental theory, in that a learning analysis can be 
a nondevelopmental candidate to replace the more traditional developmental analy- 
sis (e.g. 10). Thus it is significant to note that new and increasingly complex behav- 
iors of children have been brought under the control of environmental contingencies 
exemplifying both positive and negative reinforcement; that basic mechanisms of 
learning such as conditioned reinforcement have their existence in children’s behav- 
ior; that familiar effects such as behavioral contrast can be seen even in infants; that 
different types of reinforcers have differential effectiveness, sometimes predictable 
by other characteristics of the child; and, inevitably, that many of the issues 
still debated within learning theory can also be contested in the arena of child be- 
havior. 
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A New Method 


This section of the review might well start with a bit of methodological ingenuity. 
In the operant analysis of learning (indeed, of behavior in general) the concept of 
steady state performance is of central importance (cf. 355, chap. 8). Steady state 
refers to an experimental situation within which the behavior being analyzed exists 
in a stable yet so delicate an interaction with the experimental setting that it is 
maximally sensitive to the widest variety of experimental variables applied to it; yet 
when these variables are discontinued, the behavior returns to its characteristic 
baseline, ready for further analysis. Several methods have been developed for main- 
taining samples of children’s behavior in approximations to this experimental ideal 
(cf. 25); however, the collection could hardly be considered complete or even effi- 
cient. Thus a technique developed by DeCasper & Zeiler (87) is more than welcome. 
These experimenters secured agreement to develop special “locks” for the school 
lockers of young children. Access to the locker (to store clothing, obtain necessary 
school materials, etc) was accomplished only by responding on a manipulandum 
which turned on lights (displayed on the locker) according to one or another 
reinforcement schedule. When ten lights had been turned on, the locker opened. 
DeCasper & Zeiler were able to maintain steady, repeated performances in their 
subjects, using light onset as a reinforcer eventually signaling an open locker, 
presumably because access to the locker was an important, frequently repeated event 
in their school routines. In particular, the method showed that children would 
display the high-rate performance characteristic of fixed ratio schedules, as well as 
the low-rate performance characteristic of differential-reinforcement-of-low-rate 
schedules, but they did not at first show the temporal discrimination which fixed 
interval schedules ordinarily produce in laboratory animals. (This lack of temporal 
discrimination under the fixed interval schedule has been frequently noted by puz- 
zled researchers pursuing operant analyses of child behavior). However, subsequent 
to sufficient experience with the differential-reinforcement-of-low-rate schedule, the 
children did display the low rate appropriate to fixed interval performance, thus 
demonstrating at least one experimental addition to fixed interval scheduling impor- - 
tant to its generality of effect across species. 


New Responses 


Sucking by the neonate has been subjected to learning analyses often in recent years 
and with fruitful and meaningful results. In general, it appears to have the character- 
istics of an operant response. Brown (39) has shown that it also conforms to the 
Premack principle, in that she has demonstrated that contingent access to a typically 
high-rate sucking response can reinforce a usually low-rate sucking response, 
whereas contingent access to the low-rate sucking response will punish the high-rate 
response. Brown noted that non-nutritive sucking on a regular nipple is higher than 
(and preferred to) sucking on a blunt nipple. She then programmed 30-sec access 
to one nipple as a consequence of finishing a fixed ration of sucking on the other, 
and obtained the reinforcement and punishment effects just noted in the form of 
increased or decreased rates of sucking, depending on the consequence. 
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By contrast, Hillman & Bruner (180) analyzed the effects of reinforcement sched- 
ules on nutritive sucking in infants between 1 and 4 months of age. Their technique 
controlled milk delivery through the nipple according to a range of fixed interval 
schedules (maximum interval, 2 sec) and fixed ratio schedules (maximum ratio, four 
sucks). Sparse schedules (ratio and interval alike) produced shorter sucking bursts 
and longer pauses between bursts than richer schedules, an effect seen more clearly 
in the older infants than the younger ones. Arthythmic sucking in response to sparse 
reinforcement schedules was more characteristic of the younger infants. Since paus- 
ing in sucking is often correlated with increased scanning activity, the authors 
postulated increased alertness and information processing resulting from the sparse 
schedules. 

Replication of a now familiar condition effect, reinforcement of vocal response in 
infants, was reported by Ramey, Hieger & Klisz (319). Their study involved two 
normal and two “failure-to-thrive” infants, the latter characterized by retarded 
growth (third percentile or lower for height and weight) without known organic 
basis, coupled with apathy or other developmental lags and some evidence of 
inadequate mothering. Successful conditioning, as seen in longer durations and 
higher rates of vocalizations, was shown across the subjects, suggesting generality 
of the operant conditioning process despite such presumably important individual 
differences, 

Aggression in boys in the form of hitting a Bobo doll was related to punishment 
schedules by Parke & Deur (304). Punishment was offered in the form of a loud buzz 
contingent on hitting, on either a continuous (consistent) or intermittent (inconsis- 
tent) schedule, preceded by instructions that either equated the buzz to “playing 
badly” or simply to being a "bad noise” (the latter presumably avoiding the implica- 
tions of failure confounded with the former). Previous findings with similar tech- 
niques were confirmed: consistent punishment was more effective than inconsistent; 
either form of stimulation was more effective than no stimulation (an extinction 
schedule for hitting the doll); and—-not previously established—these effects were 
similar under either type of preceding instruction, thus extending the generality of 
the effect from the restricted sphere of failure effects. 

Negative reinforcement of self-criticism by children in a game-playing situation 
was demonstrated by Grusec & Ezrin (159). After noting baseline levels of self- 
criticism by the children, the experimenters at first punished the children by either 
withdrawal of love or of material rewards. Self-criticism levels did not change simply 
as a result of punishment (which was not contingent on self-criticism), but they 
increased in those children for whom termination of the punishment was pro- 
grammed contingent on self-criticism (in contrast to others for whom punishment 
was terminated noncontingently). Children reinforced for self-criticism appeared 
less disturbed by “errors” than children not so reinforced, suggesting the behavior 
had more than a trivial function for the children. In those children whose self- 
criticism was strengthened through negative reinforcement (escape from punish- 
ment), increased use of self-criticism was seen in subsequent session when the 
children played alone (i.e. in the absence of a punishing agent). 

Discriminative control of attention span during task performance was shown in 
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a study by Redd (322). He offered two retardates three tasks each, one task always 
associated with reinforcement for correct responding, another with noncontingent 
reinforcement, and the third with no reinforcement at all. In the course of training, 
the task associated with reinforcement for correct responding came to maintain 
steady task behavior; the other two did not. This contro] was shown in novel settings 
and with novel experimenters. Given a choice among the three tasks, the children 
chose either the contingently reinforced or the noncontingently reinforced tasks 
over the nonreinforced task, showing no preference between the first two. Thus it 
was seen that through an appropriate history of reinforcement, attention span could 
be maintained by the task itself, to some degree independently of the preference 
value of the task. (That is, contingently reinforced and noncontingently reinforced 
tasks were preferred equally, but only the contingently reinforced tasks came to 
control attentive, ongoing performance). 

Finally, it may be noted that children’s creativity, in one definitional form or 
another, became the subject of reinforcement. Goetz & Baer (143) studied the 
blockbuilding activities of three preschoolers, defined a list of block forms, and 
applied descriptive social reinforcement to the production of every new form dis- 
played in each session’s construction. (In other words, only the first appearance of 
any form was reinforced in each session; subsequent productions of the same form 
in that session were ignored.) The number of different forms made in each construc- 
tion proved responsive to the reinforcement contingencies; furthermore, the emer- 
gence of new forms, never seen before in any prior constructions by the child, also 
increased during these reinforcement conditions. The combination of increased 
diversity of forms plus the emergence of new forms was postulated as a useful 
definition of “creative” blockbuilding and as a socially desirable outcome. 

Rather different measures were involved in another study in which Stratton & 
Brown (387) reported that grade school children could be trained for both quality 
and quantity of the solutions they produced in a series of problem-solving tasks (a 
combination labeled “creative thinking” by the authors). Prior studies had devel- 
oped one or the other of these dimensions, neither of which represents a desirable 
outcome if it is at the expense of the other. The present study showed the relative 
ease of programming for both aspects of problem solving in a factorial design 
combining production training, judgment training, both in combination, and nei- 
ther. A behavioral mode of problem solving was suggested from the results. 

Still another approach to creativity training was exemplified in a study which 
reported that children’s ideational creativity was modifiable by reinforcement condi- 
tions (particularly by giving a penny for every successive idea generated in response 
to a task). Ward, Kogan & Pankove (408) noted that the penny was the most 
effective of several incentive techniques used, and that their current sample of low 
SES children yielded results highly similar to the middle class samples involved in 
earlier research. They also pointed out that under the uniform treatment conditions 
of their study, those children whose ideational creativity was increased maintained 
the same rank-ordering they had shown prior to the increase; from this finding, the 
authors argued for the “greater importance of capacity than of motivational vari- 
ables” (408, p. 699). Their results would have allowed them to argue differently, 
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however, if ideational creativity can be increased by reinforcement, for then applica- 
tion of much reinforcement to certain children and little to others should change 
their rank ordering. The next experimenter meeting the same children and applying 
uniform reinforcement to them would probably note that all increased but in the 
same rank order. However, that rank order would represent not capacity but the 
prior experimenter’s differential reinforcement programs. 


Basic Mechanisms 


Classical conditioning of children has not been as frequently pursued as instrumen- 
tal conditioning in recent years, perhaps because the mechanism is considered 
reasonably well known and because those responses susceptible to it are not consid- 
ered as important to child development as are instrumental behaviors. Nevertheless, 
Werden & Ross (421) used eyelid conditioning in preschoolers to produce data of 
developmental significance. Their study utilized procedures of trace and delay con- 
ditioning of eyelid responses to pure tones; both paradigms were successful, but the 
delay procedure more so. Comparison of these data to those produced in earlier 
studies with retardates and college students showed similarities and differences in 
each direction, allowing the conclusion that developmental level (presumably the 
variable at issue among the three populations) was important in determining the 
effectiveness of a basic learning mechanism. What remains to be seen, of course, is 
whether these differences are relatively invariant or whether they can be erased (or 
even reversed) by theoretically minor adjustments of the conditioning procedures 
for each population (e.g. a change in UCS, or schedule, or interstimulus interval). 
Depending on how minor the adjustments could be, such demonstrations would 
tend to diminish the significance of developmental level differences. 

Delay in reinforcement is usually considered to cause a decrement in learning, 
perhaps a failure of learning, relative to the effects of immediate reinforcement. 
There is a literature confirming this generalization for children and also a small body 
of studies indicating special cases in which delay has no (or even beneficial) effects 
on children’s learning. To these Goldstein & Siegel (146) add another case: with a 
two-choice successive visual discrimination problem, they showed that forced pre- 
exposure of the discriminative stimuli during the delay and interstimulus intervals 
of the paradigm obviated the usual decrement produced by delayed reinforcement. 

Millar (278) also offered a study of delayed reinforcement in young infants (4-8 
months), using zero-, 1-, 2-, and 3-sec delays between a hand pull and “perceptual 
reinforcement.” In general, a sequence of experiments appeared to show that the 
longer the delay, the less likely was successful conditioning; however, with extended 
experience and improved procedural controls, this gradient was nearly erased. 
Unfortunately, considering the response used by Millar, it is not clear whether to 
classify the study as one of delayed reinforcement or as one conditioning extended 
duration of the hand pull; and considering certain of his procedures, it might be 
considered a pause-shaping study. In the former case, the differences would proba- 
bly be considered less informative of the basic nature of infant cognition than of the 
need to program successively longer response durations, a needed operative in any 
shaping study in any organism. 
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Conditioned reinforcement has been a mainstay concept in all learning ap- 
proaches to child development, and it is common in such approaches for their 
treatment of socialization, language, and personality to hinge strongly, if not exclu- 
sively, on the process. Nevertheless, in the animal laboratory (where presumably the 
effect should be conditionable with powerful variables, such as food reinforcement 
in deprived animals), the effect is known as fragile and difficult to replicate. Perhaps 
the most robust displays of the phenomenon are in chain schedules, in which the 
conditioned reinforcer is never entirely divorced from the unconditioned reinforce- 
ment on which it depends in its initial shaping. (In such arrangements, the condi- 
tioned reinforcer is not literally paired with the unconditioned reinforcement, but 
instead serves as a discriminative stimulus for often extensive response performances 
which eventually gain the unconditioned reinforcer.) Thus a demonstration of con- 
ditioned reinforcement in children outside of the chain schedule arrangement is 
always significant. Just such a demonstration was offered by Favell & Favell (111), 
who placed children in a match-to-sample situation involving compound stimuli, 
which could be matched on the basis of either form or color. Form and color 
matches were reinforced by a token system equally often. One initially neutral 
stimulus was paired with each reinforcement delivery, and a second initially neutral 
stimulus was given at nonreinforcement times. The test of conditioned reinforce- 
ment consisted of programming the paired stimulus contingent on every match to 
a given dimension (e.g. form) and the nonpaired stimulus contingent on every match 
to the other dimension (e.g. color). Three of five children showed clear conditioned 
reinforcement control, preferring to match the paired-stimulus dimension, ac- 
comodating promptly to reversals of the dimension receiving the conditioned rein- 
forcement and reversals of the roles of the initially paired and nonpaired stimuli. 
The remaining two children, however, showed only dimension and stimulus prefer- 
ences, unaffected by the conditioning contingencies. Thus the Favells’ study again 
demonstrated the existence of the conditioned reinforcement phenomenon, but also 
displayed its failure to produce uniform results outside of the chain schedule. 

Nevertheless, the phenomenon has also been demonstrated in infants (an ex- 
tremely high priority research need for learning approaches to infant socialization). 
Silverstein (361), working in Lipsitt’s laboratory, taught 10-month-old infants a 
spatial discrimination on the basis of a conditioned reinforcer established within a 
single experimental session. First, tone was paired with cereal presentations as 
reinforcement for touching a target; after each successful touch, there was an 
intertrial interval during which a different tone was heard. Subsequently, the infants 
were presented with two targets simultaneously, and heard the first tone as a 
consequence of touching one and the second tone as a consequence of touching the 
other. The previously cereal-paired tone produced more responses to its target than 
did the tone previously paired with the intertrial interval, thus displaying an appar- 
ent conditioned reinforcement effect, and in a very straightforward, reality relevant 
situation. There remains to be seen, of course, the reliability of this effect across 
infants, and the development of techniques which could give it durability and 
generality. 

Conditioned reinforcement within the chain schedule paradigm is generally a 
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reliable phenomenon, and its apparent analogue in behavior modification, the token 
reinforcement system, is very widely used with typical success. (The token system 
may be considered an analogue of the chain schedule in that in the token system 
one set of responses is reinforced by tokens, which then serve as discriminative 
stimuli for another set of responses—trading them in—which produces already 
effective reinforcers.) Nevertheless, remarkably little analytic research has been 
reported on token systems, i.e. research establishing how/why the systems work. 
Thus a study by Smith (370) requires note, even though it is a study of a previously 
Operating system rather than a currently operating one. Smith exposed two groups 
of children to a two-button discrimination task, reinforcing one group with tokens 
previously paired with trinkets and the other group with tokens not previously 
paired with anything, for pressing one of the two buttons. Subsequently, both groups 
underwent extinction, receiving nothing for pressing either button. During the final 
extinction condition, the group that had received unpaired tokens pressed the two 
buttons equally, but the group which had received previously paired tokens for 
pressing one button continued to press that button in preference to the other. The 
study, although not directly representative of most practically useful token systems, 
nevertheless does control for a number of variables typically confounded in token 
system applications. 

Watson & Ramey (414) offered a speculative and novel suggestion about condi- 
tioned reinforcement in the young human infant on the basis of some unusual results 
in an infant conditioning study. In effect they suggested that a nonsocial reinforcer 
which successfully takes control of some instrumental behavior of the infant will 
subsequently be responded to as if it were a “social” stimulus. In that so much of 
the infant’s behavior is coming under the control of social reinforcers, the general- 
ized classing of all reinforcers as social reinforcers may well be a reasonable, if 
temporary, response. The idea clearly deserves further definition and research atten- 
tion. 

In general, the year produced a fair number of successful demonstrations of the 
conditioned reinforcement effect, most of which were treated by their authors as 
straightforward conditioning phenomena. Libb (243), however, reported success in 
showing acceleration of a candy-reinforced response by stimuli paired directly with 
candy presentation, but failed to find any increase in rate in the presence of a cue 
signaling that such conditioned reinforcers were available as response consequences. 
He interpreted that pattern of success of the conditioned reinforcer and failure of 
its cue as evidence for a frustration interpretation rather than a conditioned rein- 
forcement interpretation (a not uncommon alternative view). Nevertheless, it must 
be remembered that his paradigm was attempting to develop discriminative function 
for a stimulus on the basis of a newly, perhaps weakly conditioned reinforcer; higher 
order conditioning with weak stimuli is always a difficult feat. 

Behavioral contrast plays no particular theoretical role in current approaches to 
child development, but it is a familiar observation in learning; hence, it is reasonable 
to ask for demonstrations of it in children’s learning in the interests of evaluating 
its generality as a phenomenon. In general, contrast is seen when behavior is weak- 
ened in one situation and consequently increases in another, despite the fact that 
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the increased rate shown in the second setting will not gain any more reinforcement 
than was secured by the earlier rate. Previously no demonstrations of this effect had 
been noted for children (apart from one report of a brief, transient effect). In the 
past year, however, sustained contrast has been reported twice, once for children 
and once for infants. Waite & Osborne (405) presented children with alternating 
schedules of variable interval reinforcement, and observed the usual differential 
rates of response (the schedule with the shorter average interval yielding the higher 
rate of response). They then replaced one of the variable interval schedules with an 
extinction schedule, and noted a sustained rise in rate in the remaining variable 
interval schedule during subsequent alternations. This rise in rate could not earn 
more reinforcers for the children, yet it persisted, allowing a conclusion of behav- 
ioral contrast (a conclusion which is a label, not an explanation). 

Working with neonates, Kobre & Lipsitt (217) used similarly alternating sched- 
ules of reinforcement with an experimental group, in this case, 5-min periods of 
sucking for a water solution alternating with 5-min periods of sucking for sucrose. 
Rather than alter the schedule for these infants, the experimenters estimated what 
their water-sucking rate would have been in the absence of the preferred sucrose 
schedule by observing a second group sucking on water exclusively. The water- 
sucking rate of the second group was higher than the water-sucking rate of the group 
alternating between water and sucrose, allowing a conclusion of negative behavioral 
contrast. The authors took this as evidence that “memorial processes are not absent 
in the neonatal child” (p. 81), a conclusion pertinent to recent hypotheses about very 
short memory contingencies in newborns (412). 


Motivational Vartables and Setting Effects 


A number of studies continue to show that the operation of basic learning mecha- 
nisms can be affected by variables other than those directly involved in the condi- 
tioning contingencies. Babad (9) reported a further detail in the nature of the social 
deprivation/satiation procedures shown by Gewirtz & Baer (138, 139) to affect 
social reinforcement. In this study, children were exposed to a 10-min treatment 
period during which they received either 2 or 16 presentations of approval from the 
experimenter. Subsequently, the reinforcing effectiveness of approval was tested in 
a discrimination task, either by the same experimenter present during the treatment 
period or by a novel one. The 2-approvals children showed greater responsiveness 
to approval than the 16-approvals children only if the experimenter dispensing 
approval in the test was the same person who had given it during the treatment 
period. This was interpreted to mean that the children developed expectations about 
the experimenter present during treatment which did not generalize to the novel ' 
experimenter. Alternatively, it might be said (descriptively) that one human’s ap- 
proval is a different stimulus than another’s, and thus each is subject to its own 
deprivation/satiation procedures (analogous to the way in which we may satiate for, 
say, chocolate ice cream but not for vanilla). 

Working on the same problem, but from a somewhat different theoretical perspec- 
tive, Warren & Cairns (411) showed that young children displayed a satiation effect 
(reduced response for a contingent social reinforcer subsequent to a high rate of 
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presentation of that reinforcer) only if the satiation treatment was conducted in a 
manner which did not give the social reinforcer discriminative status for a specific 
task aimed at reinforcement. When, by contrast, satiation was attempted with 
contingent, discriminative use of approval (“right”) as a guide to correct perfor- 
mance of a specific task, its effectiveness as a reinforcer was enhanced, not decreased. 
The study is open to a variety of interesting interpretations, one of which may be 
made within the same reference as the original hypothesis of social deprivation/ 
satiation effects: satiation is a process which may be overlaid with opposite-function 
conditioning processes, such that as a stimulus is subjected to satiation procedures, 
it is simultaneously subjected to additional conditioning procedures (e.g. pairings) 
which will enhance its effectiveness. 

A study by Kubose (227), relevant to “boredom” as a motivational setting effect, 
is also reminiscent of the satiation concept. Kubose required children to look at 
either colored squares or pictures for either 8 or 18 sec prior to a simple response 
task. Whether the following response was to be instrumental in securing reinforce- 
ment or merely obedient to instructions, it was faster following 18-sec exposures 
than following 8-sec exposures (of either colored squares or pictures), suggesting 
that lengthy periods of invariant stimulation have motivating (“boring”) function. 

Experiences of success or failure have received repetitive attention recently as 
setting events affecting subsequent performance, especially self-reinforcement prac- 
tices. Masters & Peskay (261) compared black and white children from lower and 
upper SES backgrounds in their self-reinforcement rates, after success, failure, and 
neutral feedback in a game. Self-reinforcement was allowed in two forms, contingent 
(according to the child’s opinion of how well he did) and noncontingent (as much 
as he wanted, independently of how well he did). In general, it was found that 
children from low SES levels, children experiencing success, and children allowed 
noncontingent self-reinforcement showed higher rates of self-reinforcement. In ad- 
dition, black children were found to use self-reinforcement slightly more than white 
children did; but whereas white children used noncontingent self-reinforcement 
after both success and failure experiences, black children showed it primarily after 
success. Such findings hardly clarify the nature of socioeconomic status or race as 
variables in the learning processes, but they do invite analysis, serving presumably 
as the “carrier” variables cited by Lewis in a discussion of cross-cultural variables 
(239). In another, closely related study, Masters (260) showed that children experi- 
encing success or failure on an initial task, as well as greater or less rewards than 
other children had been seen to receive, typically showed high rates of self-reinforce- 
ment on a later task after success, whether it was allowed in contingent or noncon- 
tingent form; and they showed high rates of self-reinforcement after failure only in 
its noncontingent form, or only on tasks quite different than the first one. 

Deutsch & Stein (95) also found that failure on a task was effective setting event, 
this time in the realm of “private speech” (i.e. speech not addressed to anyone in 
particular or requiring a response from anyone). Preschoolers who failed a task 
showed higher rates and more articulate forms of private speech than children who 
were interrupted in the task (not through failure but because of inadequate materi- 
als) or children who performed the task successfully. It remains to be demonstrated 
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what function private speech may have for any other behaviors of children; the 
authors did note a positive relationship to MA. 

One of the most basic qualifiers of the effect of any learning contingency is the 
effectiveness of the reinforcer involved in that contingency. In the past year, several 
studies have aimed at the clarification of such reinforcer differences. Blair (27) used 
a size discrimination task with middle-class grade-school boys, offering either tangi- 
ble, social, or performance (correctness) feedback as reinforcement. With boys who 
could be classified as normal achievers, social and performance reinforcers were 
most effective; with low achievers, tangible reinforcers were most effective. In a 
similar vein, Pawlicki (308) compared the effectiveness of contingent and noncontin- 
gent supportive comments, neutral comments, or silence in maintaining a simple 
marble-sorting task. Increases over prereinforcement baseline rates were obtained 
only in the case of contingent supportive comments, although subsequent compari- 
sons of the reinforcers yielded no continuing differences. In this case, the differential 
effectiveness of the stimuli offered was presumably struggling to surface through a 
heavy overcast of obedience to instructions, which is probably not the optimal 
situation in which to compare reinforcers. Moffat, by contrast, may have obviated 
such instructional complications by using an avoidance situation to compare the 
value of two types of recorded music (285). His subjects listened to records and 
could respond to avoid a regularly scheduled interruption of that music. The music 
was rated as having high “interest value” for one group, low “interest value” for 
another. The group listening to high interest value music acquired steady avoidance 
responding better than the other group; but no differences were seen between the 
groups in the subsequent extinction of the avoidance response—an interesting dis- 
parity. Haugan & McIntire (176) offered a rare comparison of three reinforcers 
suitable for shaping infant vocalizations by evaluating food, tactile stimulation, and 
imitation of the infant’s vocalizations by the experimenter. All proved effective, but 
imitation of the infant was clearly the most effective stimulus of the trio in increasing 
rate of vocalization. Such a finding immediately invokes the possibility that the 
function of the stimulus may have included evoking repeated subsequent vocaliza- 
tions as well as (or even instead of) reinforcing prior vocalizations. The authors point 
out that vocal imitation may have been more effective because of its relative immedi- 
acy of delivery and greater variability of content, as well as the possibility that their 
subjects—institutionalized 3- to 6-month-olds—were somewhat deprived of such 
stimulation. 


Discrimination Learning and Transfer 


The bulk of publications in this area of relevance to developmentalists is still 
concerned with whether and how younger children perceive, organize the task, 
mediate, are attentionally biased, and therefore transfer discrimination training in 
ways that are qualitatively different from those utilized by older children. Flavell’s 
production deficiency and Reese’s mediational deficiency hypotheses have been 
appropriately regrouped and discussed developmentally by Kendler (208). She con- 


DEVELOPMENTAL PSYCHOLOGY 25 


siders both as demonstrable forms of mediational deficiency, and accordingly re- 
defines the latter as contro/ deficiency, arguing that both are involved in the 
transition from sensorimotor to symbolic-representational levels of functioning. 

Along with increased attention to discrimination skills in younger children has 
occurred a corresponding increase in the number of studies focussing on perceptual 
and attentional determinants of children’s learning, including stimulus salience, 
dimensional dominance, and attentional training (37, 147, 235, 296, 366, 367, 388). 
In contrast to the generally positive findings of these studies, three studies that 
compared verbal and visual techniques directly found some advantages for the 
former (206, 207, 436). Fein (112) found kindergartners less sensitive to distinctive- 
ness and verbal instructions than third graders in a transposition task, and Reese 
(324) found that ADC helped and AEC hurt discrimination in ways not attributable 
to rehearsal or perceptual differentiation. Distinctiveness of cues may thus be better 
viewed as a consequence than as an antecedent of dimensional and attentional 
processes in discrimination. 

Reversal, ID and ED shifting continue to be studied in their own right, and 
optional reversal shifting appears to be a linear function of log age (209). It is not 
particularly stable over one week, but for that half of the kindergarten and second- 
grade children studied by Brier & Jacobs (35) who were stable, it was positively 
related to IQ. The use of half reversals or partial dimensional reversals was studied 
by Miller (283) and by Trinder et al (400). The trend toward finding surprising 
competence in younger and younger children is exemplified in an excellent study 
combining elements of oddity learning, transposition, and optional shifting by 
Brown & Scott (38). The authors conclude that their results are “inconsistent with 
theories of learning which posit a developmental progression from single-unit pro- 
cesses to complex mediational processes, unless that progression can be said to occur 
before 3 years of age” (Abstract). 

Color or form dominance has again been shown to be modifiable by either 
differential reinforcement (FR 5 vs FR 2) or by instructions to verbalize in 4- to 
7-year-olds (162). Better recall of previously discriminated positive than negative or 
neutral cues has been demonstrated in a design controlling differential frequency 
and recency of exposure (323), and the result is stable over three consecutive 
discrimination problems. Incidental recall of a redundant, relevant cue is superior 
to recall of an irrelevant cue following discrimination, and a recency explanation 
is offered by Anderson (4). 

Various kinds of stimulus summation and compounding received attention in 
1972 in multiple-cue and training-generalization studies. Simultaneous use of multi- 
ple cues with different assigned weights was demonstrated in a psychophysical 
judgment task for school-age children by Montanelli (286). Spiker’s stimulus in- 
teraction hypothesis was further supported in a study by Berch (21). The range of 
stimuli used in stimulus generalization studies appears to directly affect discrimina- 
bility of stimuli within that range, which in turn determines generalization gradi- 
ents, according to Croll (80). Formal predictions for multidimensional 
generalization as a function of verbal pretraining on one dimension were derived and 
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sustained in a study by Spiker, Croll & Miller (378). Bogartz (28) provided an 
alternative analysis of the same data using three cue-selection models which he 
claimed explained the data of Spiker et al equally well. Their reply (379) illustrates 
what may be the problems symptomatic of a field.in which the number of theoretical 
formulations exceed the number of significant and reliably demonstrable phenom- 
ena. 

Task, format, and other boundary conditions nay be more potent than theoreti- 
cally subtle differences among processes and mecLanisms in determining the amount 
of transfer of discrimination training according to the results of four experiments 
reported by Campione & Beaton (50). 


Concept Formation: A Strained Concept 


Classıcal concept formation studies have almost disappeared from the literature, but 
the learning of conceptual classes, rules, and sequential-probabilistic patterns by 
children have all received research attention in 1¢72. Caruso & Resnick (53) exem- 
plified a trend toward logical and sequential task analysis in place of abstract 
theorizing. Having logically defined an “optimal” sequence for three matrix classifi- 
cation tasks (simple to complex) they demonstrated most rapid learning and best 
transfer to new matrices with similar structure Zollowing “optimally” sequenced 
training. The Bower-Trabasso procedure was usec to test for continuity vs. noncon- 
tinuity in concept acquisition in an excellent study by Tighe & Tighe (397). The 
procedure involves presolution reversals of reinforcement contingencies and should 
increase errors to criterion (as compared with nc-reversal controls) if the child is 
continuously developing partial knowledge abort the solution, but not if he is 
engaged in relevant cue selection, as are college-age subjects. Fifth graders showed 
no such interference attributable to presolution reversals, but kindergarteners and 
first graders did. Cue-selection, rather than logical hypothesis generation seemed to 
account for the effect. 

Verbal processes continue to be implicated as determinants of the more elaborated 
concepts of older children (342). It is noteworthy, however, that the involvement 
and contribution of verbal behaviors to the formetion of class concepts defined as 
perceptual sets in preschoolers (422) and toddlers (292) is beginning to merit consid- 
eration. Verbalization by a model facilitates vicarious rule learning, generalization, 
and retention (439). Modeling and instructions to imitate were more effective than 
rehearsal and initial probability in a rule-governed verbal association game (333). 
Modeling and rule provision were effective in the absence of extrinsic reinforcers of 
any kind in a dial-reading, numerical concept Las: (441). 

Bogartz’s attentional-mnemonic model of binary prediction received confirmation 
by Jones (198), and cue-utilization and achievemert in one-and two-cue probabilis- 
tic tasks were studied developmentally by Deffenbacher & Hamm (88). The latter 
found that sensitivity to environmental regularity was high at all ages, that Weir’s 
U-shaped function relating age and achievement cbtains only when the task pro- 
vides one partially informative cue, and that when two cues of different probabilistic 
validity were present, performance increased linearly with age. 
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Memory 


The study of mnemonic capacities and processes in children has shown a remarkable 
surge in popularity, perhaps reflecting recent increases in concern with attention, 
information processing, and memory in adults; advances in studies of children’s 
verbal mediation and imagery; concern with developmental psycholinguistics; and 
Piaget’s recent concern with mnemonic processes. A discussion of memory as an 
epiphenomenon of various cognitive developments and social interactions was pub- 
lished by Meacham (273). In general it appears that most studies are concerned 
either with a variety of contextual effects or with various measurable, trainable, or 
otherwise inducible strategies for processing information in ways that make it more 
retrievable, and these two foci are used to summarize the literature below. 

A few memory studies are noteworthy but do not fit in the two categories men- 
tioned. Retention by preschoolers was studied by Rosner (335) in a task similar to 
the game of “concentration,” where over successive tests, items are drawn from the 
same pool, placed face down in a row, one at a time, and then S is given a copy 
of one of them and must locate its copy by position in the face-down array. Experi- 
enced Ss show primacy effects (like adults) when a new list is used. Repeated testing 
from a small pool reduces retention regardless of serial position, and, after the first 
test, shows no primacy effect with children. Analogously, in paired-associates (P-A) 
learning Cole & Kanak (73) demonstrated that while first and third graders learn 
slower than fifth and seventh graders, there are no age differences in recall, and R-S 
recall was as good as S-R recall at all ages. Short-term recognition time for words 
was corrected by subtracting simple naming time from decision time, the difference 
being termed “memory scanning time” by Kirsner (215), who found no age differ- 
ences in memory scanning time by this procedure when presentation list and test 
word were given auditorily. There were developmental differences, however, when 
the recognition list and the test word were presented in different modalities. Thus 
apparent age changes in mnemonic latency may be more a result of perceptual and 
response factors than of developmental changes in memory scanning rate. Reinstate- 
ment of previously learned behavior was demonstrated in 7-year-olds by Hoving & 
Choi (186) when they presented the previously learned response, but not when they 
presented the previous stimulus. Hoving et al (187) also demonstrated reinstatement 
by repeating pairs in a story context, but found no differences between 5- to 8-year- 
olds and 8- to 11-years-old. 

Initial and middle positions of the serial recall curve are more strongly dependent 
upon age and IQ than are terminal positions (110) suggesting that short-term 
memory is less influenced by development than long-term memory. Similarly Bel- 
mont (19) showed that acquisition and retrieval are strongly influenced by age and 
IQ, but that short-term forgetting rate is not. 

Nonverbal context can clearly serve to organize and facilitate recall, especially in 
young children. Reese (325) demonstrated that acquisition of paired-associate pic- 
tures was facilitated by showing the stimulus and response figures in mutual interac- 
tion as compared to showing them merely side by side. Repetition of responses 
facilitated primacy position recall as well. A facial context within which various 
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curved lines could be interpreted as smiles and frowns did not facilitate discrimina- 
tion and recognition in a study by Pick et al (£11). For children “too young to 
mediate verbally” (3- and 4-year-olds) Conrad (75) found that color and form 
differences were equally well recognized from short-term memory, and that recogni- 
tion was improved when items were redundantlr differentiated in both color and 
form. 

Five studies this year were concerned in one way or another with words vs 
pictures as stimuli in children’s memory tasks. Davies (86) showed 9-year-olds 
either names or pictures of objects. Pictures were more effective when Ss had to 
identify the object immediately in an array, bit words were more effective in 
subsequent written recall. Holyoak, Hogeterp & Yuille (182) found that the P-A 
learning and recall of interacting figures was superior to the same pair presented side 
by side (cf Reese 325), and that simultaneous ZE-*erbalization of the names of the 
paired items connected by a verb was better than their connection by a conjunction 
for both acquisition and recall. Again there were no age differences. Rowe (339) 
showed that in a discrimination learning task pictures were more easily discrimi- 
nated than words by both adults and fourth greders, regardless of whether the 
required response was verbal or nonverbal. Using 5-, 8-, and 12-year-olds, Rand- 
hawa (320) compared visual vs verbal input and verbal vs nonverbal output. Visual 
input and nonverbal (reconstruction) recall was the best combination; verbal input 
and verbal output was the worst. Only output moce interacted with age, with older 
children doing better on verbal. Perhaps the most comprehensive study on this topic 
is that by Lynch & Rohwer (252). Pictures as stimuli were better recognized and 
matched than words, and the difference was greate- for sixth than for third graders. 
Sixth graders were also better at verbal recognition of pictorial stimuli. When E 
provided at exposure the names of stimuli, the grace-by-mode interaction vanished. 
In summary, it appears that pictorial beats verbal at all ages, and that relevant, 
meaningful verbalization at exposure, when added to pictorial presentation, may 
facilitate recall, especially when the recall task invclves verbal stimuli or responses. 

Within the domain of verbal memory, two studizs demonstrated that both rapid 
and long-term recognition of words is facilitated when encoding makes use of the 
evaluative (good-bad) connotations of the words (57, 117). Meaningfulness of both 
stimuli and responses is positively associated with P-A retention (142). Free recall 
is superior and more clustering occurs with semantically related than with unrelated 
words, which in turn are recalled better than acoustically related words by third 
graders (443). Horvitz & Levin (185) found that maintaining semantic constancy 
of the sentence or phrase context in which verbal paired associates were presented 
to that in which they were tested facilitated recall. Imagery instructions facilitated 
performance of sixth but not third graders. In free recall first graders may cluster 
more than adults instead of Jess (94). If the relationship between words is one of 
functional complementarity, children cluster more. but if it is one of categorical 
similarity, then adults cluster more. The finding eppears to be analogous to the 
syntagmatic-paradigmatic shift in word associations associated with increasing age. 
The same phenomenon may underlie a finding by Suzuki (391) that children (but 
not adults) learn noun pairs better in the syntactic context, noun-verb-noun-con- 
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junction-pronoun than in the context, noun-conjunction-noun-verb-pronoun. That 
is, “dog bites man and me” creates a functional, syntagmatic relationship between 
the to-be-remembered items “dog” and “man,” which is mnemonically helpful to 
children. However, “dog and man bite me” gives a similarity or paradigmatic link 
between the to-be-remembered items that is less useful to children. Similarly Ehri 
(103) found that deep grammatical structure relationships, but not surface structure 
ones, facilitated retrieval of noun pairs, and Thurlow & Turnure (396) found that 
elaborative contexts, especially semantic paragraphs, facilitated P-A acquisition and 
reversal. 

A multibowed serial recall curve for lists of nine digits was produced by experi- 
menter-imposed spatial and temporal groupings during presentation in a study by 
Harris & Burke (169) of elementary school children. Corsini (76) found that spatial- 
positional cues were not used for memory purposes by kindergarteners, but verbal 
and other perceptual’cues were helpful. Heider & Olivier (177) studied color naming 
and color matching from memory in two cultures whose color names and categories 
differed markedly. Contrary to Wharfian expectations, memory matching was un- 
affected by color-name semantics, but, of course, language and culture differences 
showed up in the naming task. Deaf and hearing children were compared when 
using and not using vocal labels as an aid to recalling object arrangements by 
Chovan (61). Differences in MA rather than deafness or use of labels accounted for 
most of the variance in recall. Similarly, Kimball & Dale (212) found that although 
recognition for colors was related to ability to name them, again CA, MA, and IQ 
strongly interacted with both naming and recall. 

A final group of memory studies has as a common feature the elicitation or 
observation of specific mnemonic strategies, as distinct from those which merely 
provided mnemonic aids. Marx (258) found that younger elementary children 
showed better recall, but older children showed better recognition for words whose 
spelling had been practiced 3 months earlier. He interpreted the interaction as a 
consequence of greater verbal elaboration by the older children. Mallory (256) 
compared kindergarten and second graders classed as verbalizers with those classed 
as visualizers in a P-A task where verbal, visual, or no elaboration was provided. 
Isomodal advantages of selective elaboration strategies were demonstrated, which 
generalized to no-elaboration items. Instructions to remember, to group phoneti- 
cally vs semantically, to categorize, or to organize all have been demonstrated to 
facilitate recall (24, 78, 216, 265, 350). However, kindergarten and first-grade Ss, 
unlike fifth graders, study no differently and recall no better when instructed to 
memorize than when instructed merely to look at the stimuli (5). Using labial vs 
nonlabial phonemes and labial EMG recordings with second and third graders, 
Goldstein, Locke & Fehr (145) found no evidence that subvocalizers excelled non- 
subvocalizers in free oral recall of pictures and words. Instructed “visual rehearsal” 
helps preschoolers remember (277), and spontaneous manual manipulation of ob- 
jects to form “dynamic images” facilitates recall even in “preimagery” children 
(429, 430). Kindergarteners benefit from instructions to form images only when the 
objects are tactually available for manipulation, whet 4 they are visually 
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The most convincing studies in general appear wo be those which define a stimulus 
feature, task poverty, or ancillary behavior, manipulate and measure it directly, and 
then assess its effects on memory—all as a functien of a carrier variable like age or 
intelligence. These development-by-treatment d2signs are particularly revealing 
when a contributory behavior has been identifed correlationally as facilitating 
learning or memory in naturalistic settings at developmental level o, and can then 
be experimentally synthesized and separately observed as a potential facilitator at 
level n - k, 


IMITATION 


Imitation by children is a longstanding target of research, in part because of its 
importance in socialization and language development, in part because it is a whole- 
sale method of behavior change, and no doubt because it is a complex behavior and 
thus appropriate to competing theoretical analyses. The past year’s research on 
imitation can be organized into three main categor:es: studies of variables contribut- 
ing to the development or modification of imitation as a behavior; comparison of 
imitation and direct reinforcement or punishment; and the use of imitation to 
develop other behaviors. 


The Development and Control of Imitation 


Schroeder & Baer (348) used operant techniques to teach two retarded children 
increasingly skillful vocal imitation by two differen’ methods of direct shaping; they 
then compared the generalization of imitative skill resulting from each method. One 
method labeled “serial” consisted of teaching the child to imitate three words or 
phrases, each taught separately, until a criterion of mastery was achieved, then the 
next, etc. The other method called “concurrent” imvolved teaching three words or 
phrases intermixed, bringing all three to a criterion of mastery at roughly the same 
time. Each subject experienced both methods in repeated alternations. Interspersed 
tests of generalized imitative skill (with untrained words and phrases) showed 
consistent superiority of generalization resulting from each application of the con- 
current method. Although the concurrent method is not the usual method of teach- 
ing language to deficient children, the authors suggested that it might prove to be 
advantageous for such skills. Replication is clearly in order, especially with both 
smaller and larger units of speech, and with generalizable skills other than vocal 
imitation. 

Geshuri (137) added another detail to the literacure concerning the facilitative 
effects on imitation of observed reinforcement of the model. He showed young 
children video strips of a model displaying certain behaviors; over repeated trials 
the model displayed either a steady rate of these responses or an increasing rate, and 
was seen to receive either approving comments, neutral comments, or no special 
stimuli. (An unusual feature of the study was a preliminary phase, during which the 
reinforcing function of the approving comments to de used in the video strips was 
evaluated for the subjects.) Only those children watching the model receive approv- 
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ing comments showed imitation of the model, matching the rate (constant or in- 
creasing) displayed by the model over trials. Thus it was concluded that observed 
reward of a model is a cue for matching the model. 

Two other studies uncovered techniques for inhibiting imitative behavior. Jeffrey, 
Hartmann & Gelfand (190) had children watch a model make choices in a simple 
preference task. Some children were reinforced directly for matching the model’s 
responses; some instead experienced a nurturant interaction with the model; and 
some had neither experience. Those children reinforced for doing so imitated the 
model; those who had just had a nurturant interaction with the model matched no 
more often than a chance level; and those who had experienced neither (i.e. who 
had encountered an unresponsive model immediately prior) showed negative imita- 
tion—they systematically mismatched the model’s preferences. In a subsequent 
phase with the model absent, only the previously reinforced children continued to 
match; the others showed chance performances. Negative imitation, or systematic 
mismatching, is a little researched phenomenon, but its logical importance would 
seem equal to that of positive imitation. The method of this study might well prove 
fruitful for future investigations in greater detail. 

Examining a different class of variables, Masters (259) also found a negative 
imitation effect. In his study, preschoolers interacted with an adult partner; different 
groups discovered that the partner received more, the same, or fewer rewards than 
they did. Subsequently, the adult modeled certain behaviors of an innocuous sort, 
and also gave away rewards to supposed “poor children.” Then the children had 
their chance to imitate his behaviors, including the giving away of rewards to other 
children. The unusual finding was that children who had seen the adult receive fewer 
reinforcers than they had received imitated the model less than children in other 
conditions. Why this should be awaits analysis. Meanwhile, Masters recommended 
that a proper definition of altruism should require a demonstration that the child 
called altruistic give away more rewards than a model has been observed to do, to 
distinguish it from mere “imitated altruism.” 


Comparison with Direct Reinforcement and Punishment 


Cheyne (59) compared direct experience of reward and punishment with imitation 
(vicarious reinforcement). In that study, young children either chose a word from 
a pair of words and were told directly whether the choice was right or wrong, or 
else they watched a model undergo the same experience. In a subsequent test of 
recall (of which word in each pair was correct), directly rewarded and imitating 
(observing) children did equally well in recalling correct items; but children who 
were directly punished recalled better those negative items than children who only 
observed those items punished. However, if a child were given explicit instructions 
about future matching during the observation phase, this difference was eliminated. 
Such studies help to emphasize that there are probably few, if any, invariant differ- 
ences between learning by imitation and by direct contingencies: either can be done 
well or badly; thus comparisons can be made to show any outcome. 
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Use of Imitation to Affect Other Behaviors 


In a simple and straightforward application of imitation and reinforcement tech- 
niques, Zimmerman & Pike (440) increased the cuestion-asking behavior of disad- 
vantaged Mexican-American second grade children. A comparison to the effects of 
praise alone, contingent on question-asking, showed that it was effective, but not so 
efficient as the combination of an adult model with the praise. An untreated control 
group showed no essential change in their rate of the same behavior over that span 
of time. Some generalization of the effect was seen with a new teacher who neither 
modeled nor praised question-asking. In addition to the experimental group-control 
group design, a multiple baseline design was usec as well, marking one of the first 
appearances of this “behavior modification” design in the more abstractly oriented 
journals of the field. It may be presumed that question-asking eventually should be 
maintained by the intermittent reinforcement implicit in the answers the behavior 
produces; an interesting question for generalization theory is how to use techniques 
such as in the present study to bring the behavior into sufficient contact with those 
natural consequences for indefinite maintenance of the response class. The Zimmer- 
man & Pike study at least suggests a strong start toward that goal. 

The intersection of imitation phenomena and television phenomena are apparent 
in a study by Wolf & Cheyne (428), in which schcol boys were exposed to models 
who either conformed to or broke a rule not to perform a certain act. One modeling 
condition consisted of live peers behaviorally brezking or conforming to the rule; 
another was the televised versions of the same acts and a third model consisted of 
a peer verbally describing breaking or conforming to the rule. Relative to a no-model 
control condition, the children generally did wha: their models did; the live and 
televised models proved most effective, and the verbal model least. Ominously, the 
effects of rule-breaking models were more apparen: in a one-month later followup 
than were the effects of rule-conforming models. “he study adds one more set of 
facts to the accumulating literature on the effect of television models currently under 
debate. 


LANGUAGE 


The obvious importance of language to child development need not be labored here. 
However, its role as the arena within which competing theoretical approaches to the 
study of development come into their clearest opposition might well be noted once 
again. In particular, as learning approaches have demonstrated increasing ability to 
theoretically and factually analyze more and more areas of child behavior, language 
has increasingly become the crucial test case. Language may be seen as the expres- 
sive, mediating handmaiden of the essentially cognitive, symbolic, internalizing 
child; alternatively, it may be seen as a remarkably complex repertoire of discrimi- 
nated operants with special utility for mediating generalization and class organiza- 
tion of behaviors in general. At the very least, it will be recognized as uniquely 
human in its full blown form, and as a very large problem for any approach. In terms 
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of this context, the past year was surprisingly quiet (although some of the work 
which might be summarized here will be considered instead under Applications). 
Three classes of research will be considered: a few examples of the conditions which 
may be arranged to produce one or another language skill; some consideration of 
the mediating function of language for the development of other behaviors; and 
some contributions toward the theory of language acquisition. 


Control of Language Skills 


The undifferentiated vocalization rate of 3- to 4-month-old infants again proved 
susceptible to social reinforcement in a study by Banikiotes, Montgomery & Baniki- 
otes (11). The social reinforcement was vocal feedback by adults. Conditioning was 
very quick, occurring in two 3-min stages of reinforcement within a single session. 
A notable component of the study was the use of both male and female experiment- 
ers to supply vocal reinforcement: no differences were seen in their effectiveness as 
reinforcing agents, nor was any shift in the fundamental frequencies of the infants’ 
voices noted as function of male vs female reinforcement. (There is some evidence 
to suggest that with older infants, the use of male or female voices as reinforcers 
could produce a shift in fundamental frequency toward matching that of the experi- 
menter. Whether this study represents a fundamental age difference, or simply too 
brief a conditioning period to allow the shift, is not clear.) 

By contrast, Whitehurst, Novak & Zorn (424) reported an experimental case 
study in which speech was improved by evocative rather than reinforcing contingen- 
cies. Their study began as an observational analysis within the home of a 3 year-old 
speech-retarded child, noting minimal levels of imitative prompts and conversation 
by the parents to the child. Experimental intervention consisted of prescribing 
relatively smail increases in the parents’ rates of using either conversation or imita- 
tive prompts with the child. Either tactic proved equally successful in accelerating 
the child’s acquisition and steady use of new words, as evident in dinner table speech 
(when all recordings were made). The study is noteworthy for its results, its careful 
observation and quantification techniques, and its use of a natural setting and the 
natural events within that setting. In addition, its interpretation is clearer than that 
of many other natura] setting researches, in that it could specify that the immediate 
controlling variable was evocative conversation or prompts rather than an increased 
use of social reinforcement (which was observed and noted as constant throughout 
the experimental phases of the study). The authors point out that laboratory analy- 
ses of language skills need not prove relevant to natural environment mechanisms 
of language development, and express a concern for questions “... that can only 
be answered by studying the development of language in situations where parents 
and children can interact in a normal manner, where variables are defined from the 
situation rather than imposed upon it” (424, p. 169). There remains, of course, the 
need for explication of why these natural parent behaviors had such potent evocative 
function for their child’s speech. Conceivably these events rely upon the reinforcing 
consequences parents have programmed intermittently throughout the child’s ear- 
lier history. If so, a final analysis may yet prove to resemble laboratory models quite 
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closely. Nevertheless, there is no denying the validity or value of the authors’ 
approach. It represents a rare intersection of psycholinguistic and behavior modifi- 
cation techniques (cf. 10). 

Bennett & Ling (20) reported a simple study of language improvement with a 
noteworthy outcome. Their subject was a 3-year-old hearing-impaired girl whose 
speech was primarily telegraphic, lacking articles, auxiliary verbs, etc. Through 
experimentally controlled imitation techniques, she was taught to use “the” and “is” 
in her descriptions of pictures presented to her. In consequence of this short, simple 
training, she was then able to describe new, umtrained pictures in well-formed 
sentences in the present progressive tense. In tha telegraphic speech impresses its 
hearers as primitive and retarded, whereas wel.-formed sentences are taken as 
evidence of good symbolic development, the stucy represents a useful analysis of 
how simple the response analysis of an apparently complex step in development may 
be for a given child. 

Language acquisition continues to be studied as an imitative process, with empha- 
sis on the naturalistic language environment (127 398) and the verbal behavior of 
the mother (183, 371). In laboratory settings, imitation and modeling have in- 
fluenced grammatical rule learning and sentence complexity more strongly than has 
exposure, expansion, sex, and bilingualism (173, 257). Appropriate, novel, two-word 
responses by 2-year-olds can result from training involving imitation, differential 
reinforcement, and holding the first word constant Whitehurst (423) describes this 
procedure as a “partially conditioned frame.” Acults as interpreters of children’s 
speech are more accurate if they are parents. (4191. According to Silverman (360), 
the complexity of children’s speech as judged by adults is not influenced by the 
context (egocentric vs social), but Moerk (284) ıas made a strong plea for the 
incorporation of more contextual nonverbal varizbles in the study of adult-child 
interaction and demonstrated the use of this strategy. One study of receptive lan- 
guage has important methodological implications. Kamil & Rudegeair (202) found 
much higher sound discrimination scores for 5- and 6-year-olds when either (2) 
stimuli were made more redundant, e.g. /bob/vs/dod/ instead of /b/vs/d/, or (b) 
multiple testing sessions were used to establish a “amiliar procedure and a strong 
discrimination set. In general, in this area there appears to be a healthy level of 
interaction and cumulativity of research between theoretical camps, between labora- 
tory and naturalistic studies, and between correla-ional surveys and experimental 
interventions. 


Mediation of Other Behaviors 


Language, used as a tool of conceptual analysis, was modeled for young children 
in a study by Rosenthal, Alford & Rasp (332). The subjects (second graders) were 
unable to solve a novel clustering task presented to them. They then watched a 
model solve examples of the problem, with different groups experiencing a silent 
model or models providing varying degrees of clarity of explanation, to the point 
of supplying a rule to handle future instances. In general, the degree to which the 
children acquired the clustering concept at issue was a function of the clarity or 
strength of the model’s verbal coding during the mcdeled performance, as was their 
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generalization to new stimuli embodying the same concept. Thus, whereas imitation 
can be a potent developmental mechanism, such studies as these demonstrate clearly 
that there exist performances too complex for quick acquisition simply through 
observation; yet observation plus a fairly minimal amount of language “coding” 
relevant to it yields prompt mastery and generalization. This is a familiar function 
of language, of course, yet not a frequently researched one, especially from the point 
of view of developmental process. 

However, a failure of language to improve mediation reported in the past year 
may hold some surprises. Quay (317) reported that administration of the Stanford- 
Binet to 50 4-year-old black children “selected from an extremely deprived, physi- 
cally and socially isolated community” (p. 245), in either standard English or in 
Negro dialect, yielded no systematic differences. An analysis of item difficulty also 
displayed no differences. Thus an abstract cognitive task, highly dependent on 
language skills, was not facilitated by administration of it in the language forms 
presumably most familiar and most useful to these children, who were presumed 
to be severely disadvantaged and perhaps most susceptible to confusion by standard 
English dialect. 

Perhaps the most thorough view of language as a mediator in the past year’s 
literature (and many other years as well) was offered by Palermo & Molfese (302). 
Deliberately turning their attention away from the first 4 years, during which 
acquisition seems most rapid and dramatic, these authors pointed out that after age 
5, language continues to develop in increasingly complex and sophisticated ways, 
typically involving phonological, syntactic, and semantic components. Further- 
more, at each notable developmental stage subsequent to age 5, corresponding 
advances in language behavior have already been noted in the literature. Taken as 
a whole, these studies suggest that ongoing language elaboration remains integral 
to, perhaps essential to, the breadth of development yet to take place throughout 
the developmental span of the organism. The argument seems undeniable, and the 
comprehensive review and integration offered by Palermo & Molfese hopefully will 
serve to better distribute research attention across the age span and developmental 
levels available for the analysis of language. 


Theoretical Argument 


Early in the year, McNamara (272) presented a Psychological Review paper which 
may be considered a major essay in the theory of language development. His central 
thesis was that language serves meaning for the developing child only to the extent 
that meaning already exists. Thus if an infant can understand the meaning of 
another person, it is then possible to decode the language used by that other person 
and thereafter to use the language himself. It is not the case that meaning is derived 
from or built upon developing language skills, but rather that language is derived 
from and built upon developing meanings. McNamara’s prime assumption is that 
a distinction can be drawn between intention and language. It seems to follow that 
the study of language development is essentially a study of an increasingly bright 
code-breaker at work. For McNamara the theory is an outgrowth of earlier state- 
ments by Brown (40); however, it is meant to add to that statement an integration 
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of vocabulary, syntax, and phonology, and to add very recent findings in linguistics 
and psycholinguistics. In doing so, McNamara was less than satisfied with the result. 
He argued that to some extent the development of vocabulary, syntax, and 
phonology were interdependent, but not completely enough to assign simple se- 
quences to their acquisition: “... the interdepencencies are only partial, though it 
is important to appreciate that at present they are quite obscure” (272, p. 11). In 
summary, it was argued that a fair analogy can be drawn between the language- 
developing child and the linguist describing a new language, with the exception that 
the linguist is constrained to state rules and the child is not. At any rate, it is a 
language development theory which thrusts primary importance and priority on 
nonlinguistic cognitive development [in which context Piaget is cited approvingly 


(p. 11)] 


Developmental Psycholinguistics 


An article by Dan Slobin (365) in Psychology Tcday makes a strong plea for the 
universality of the processes of language acquisitzon, one that is rich in multicul- 
tural, multilingual comparisons, and one which wauld not be strongly contested by 
a verbal behavior theorist. In the presence of function words only, 4- and 6-year olds 
imitate grammatical sentences more accurately then ungrammatical ones (250). De 
Villiers & de Villiers (96) found that 2- and 3-year-3lds could grammatically correct 
and appropriately carry out imperative instructions that were made semantically 
anomalous, but that only the most linguistically acvanced Ss could do so when the 
commands contained syntactic errors (reversed word orders). The penetration of 
psycholinguistics into Genevan psychology is evidenced in a study by Sinclair & 
Bronckart (362) in which all orders of two nouns and a verb were presented to 
Genevan, French-speaking children from 3 to 7 years of age. A developmental trend 
was established toward interpreting the first noun as subject and the second as 
object, regardless of verb location. 

A promising new direction in linguistic research s indicated by a series of articles 
on the development of the language of emotions (241, 242, 432, 433). The authors 
attempt to study how children come to name their feelings, using a theoretical 
framework that contains elements of Freud, Piaget, Luria, and Lenneberg. 

Two studies attempted to elicit generalized syntagmatic or paradigmatic word 
associations in order to cast light on the developnental shift between these two 
modes of response. Routh & Tweney (337) used ins’ ructions, prompting, and exam- 
ples together with candy rewards to train kindergarteners and fifth-graders to make 
paradigmatic free associations. The general shift was obtained in that the older 
children made more paradigmatic responses in all conditions. Training produced a 
small increase in paradigmatic responding to verbs and adjectives at both ages, but 
a slight decrease among younger Ss in paradigma-ic responses to nouns. Francis 
(125) used a word-matching task and directed associations with 7-year-olds, but 
found that neither form-class knowledge nor sementic class knowledge was sufti- 
cient to account for the range of responses and ~easons therefore given by the 
children. Apparently we still have a phenomenon ia search of a verifiable explana- 
tion. 
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COGNITIVE DEVELOPMENT 


The research boom in cognitive development appears to have not yet reached its 
peak, and work stimulated by and addressing itself to Piagetian questions still 
dominates the field. Replication (and failure to replicate), extension to new tasks or 
materials and training studies account for nearly equal portions of the literature in 
1972. Various applications of Piagetian notions to testing, educational practice, etc 
are gaining in frequency. 

In the journal literature, two integrative theoretical essays are of interest. Kagan 
(200) addressed himself to the cognitive development of infants, summarizing a 
germinal series of studies that have led him to conclude that infant cognitive 
development is both more extensive and more maturationally based than most 
recent theorists have held it to be. Like several other infant laboratories, Kagan’s 
is finding the orienting response to be as widely useful in assessing infant information 
processing as its earliest proponents predicted it would become. The use of response 
decrement and recovery techniques together with the incorporation of state vari- 
ables and expectation phenomena within the general discrepancy (or adaptation- 
level) format have enabled Kagan to investigate such processes as hypothesis 
forming as early as 9 months of age. In fairness to other investigators, and in support 
of Kagan’s summary, it should be noted that numerous other investigators of infant 
attending have used essentially the same logic and methods and reached essentially 
the same conclusions. 

The second theoretical contribution is a very thoughtful and tightly reasoned 
monograph by Flavell (120) on the logic of sequential acquisition of competencies. 
Both requirements for developmental theorizing and criteria of methodological 
adequacy are discussed in terms of the establishment of sequential invariance of 
development, both between and within stages. The article should be required read- 
ing for researchers attempting experimentally to disentangle the skeins of develop- 
mental process. 

Smillie (368) takes Piaget to task regarding the notion of perceptual tableaux used 
to describe the experience of the young infant. Citing research by Bower, the author 
reinterprets Piaget’s evidence and proposes an initial period of visual realism, fol- 
lowed by a period in which the infant explores the range and consequences of his 
own actions. Only later are the two systems coordinated, according to this analysis 
of sensorimotor development. 

It should be noted that Piaget’s perceptual theory regarding the error of the 
standard has been strongly challenged by Begelman & Steinfeld (17) on conceptual, 
methodological, and empirical grounds. The authors offer new data in support of 
their conclusion that the operation of an error of the standard is not a required 
assumption for interpretation of any data thus far obtained. 


Replication and Nonreplication of Piaget 


Object permanence continues to receive the major share of attention in studies of 
sensorimotor development. Evans & Gratch (108) demonstrated that the stage IV 
error (looking for a hidden object where it was last found instead of where it has 
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Just been hidden) is made as often when the previcusly found toy is hidden in a new 
place as when a different toy is hidden in the new place. They reinterpret the stage 
IV error as simply a position preference. Webb, Massar E Nadolny (415) noted the 
information from early hiding trials produced pro-active interference on the first- 
choice responses on subsequent trials, indicating memory effects, but adequate 
self-correction on second choices by 16-month-olds. Fourteen-month-olds also 
showed pro-active interference but lacked a self-co-rection strategy. Both are appar- 
ently important contributors to object constancy performance. 

Classification and conservation remain the favorite skills of investigators of con- 
crete-operational thinking. The types and sequences of free-classification responses 
found by Piaget & Inhelder, and also those found by Vygotsky, were not fully 
replicated in two studies by Denney (92, 93). TasE requirements, instructions, and 
procedures appear to play a more important role than the early studies assumed, 
but even when the early procedures were replicated closely, discrepant results were 
obtained. 

Equivalence and identity conservation as different processes were studied in 4- 
and 6-year-olds by Elkind & Schoenfeld (106), whc found identity easier, especially 
for younger children. On the other hand, Moynahan & Glick (289) found that 
identity and equivalence co-occurred in number, quantity, and weight conservation 
tasks, and identity occurred first only in one leng-h-conservation task. 

Compensation, at least as assessed by verbal exp.anation, appears not to contrib- 
ute to the acquisition of conservation of liquid, since conservation appears first, 
according to Gelman & Weinberg (136). However, compensation is found differen- 
tially depending upon whether it is assessed by verbal explanation or by instructions 
to pour liquids in a compensatory fashion. 

Three- to 6-year-olds have a clear understanding of number invariance, at least 
for small numbers, though they may not show conservation of number on standard 
tasks, according to Gelman (135). The conclusior is supported by their surprise 
reactions to surreptitious number change as wel as by verbal explanations of 
changes in number, length, and density. Again the subtlety and ingenuity of Gel- 
man’s experiments are models of conservation research that ought to be more widely 
imitated. Pufall & Shaw (316) further clarified the distinction between “true” con- 
servation on equivalence problems and pseudoconssrvation on inequivalence num- 
ber problems. Either U-shaped or flat developmental curves were obtained, but the 
information processing of 3- and 4-year-olds appeared to be based directly on salient 
stimulus differences, while that of 5- and 6-year-olds-depended more on the relation- 
ship of spatial properties to number. Discrimination rather than conservation proce- 
dures were used by Siegel (357) to study number concepts in 3- and 4-year-olds. 
Oneness and linear magnitude (constant density) discriminations co-occur and ap- 
pear earlier than discrimination of nonlinear magni:ude. Brainerd & Brainerd (34) 
point out that differential difficulty data indicatinz that number conservation is 
easier than liquid quantity conservation do not necessarily establish that the former 
always develops prior to the latter. But they established that it does by within- 
subject ordinal analyses. 

The study of formal-operational thought is gaininz interest, especially in relation 
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to the roles of logic and language. Jones (196) demonstrated that sixth-grade boys 
of equal IQ but differing verbal ability and the use of tentative statements were 
equally skilled at hypothetical reasoning. Piaget will doubtless be pleased by these 
results. He is probably not as pleased with the results of a study by Bynum, Thomas 
& Weitz (47), who used the Inhelder-Piaget protocol on invisable magnetism and 
the original authors’ method of analysis, and found in that protocol evidence for 
only 8 of the 16 binary operations of truth-functional logic. Eight of the Inhelder- 
Piaget analyses were deemed faulty, and six of the operations were alleged never to 
occur or be needed in any complex reasoning tasks. Indeed those six operations can 
be performed more simply by various combinations of the other ten. Additional 
credence is lent to this analysis by the fact that the first two authors are logicians 
and logic teachers. Finally, a contribution to the understanding of negation as both 
a concrete and a formal operation was made in an excellent monograph by Feldman 
(114). 


Piagetian Innovations 


Grouped in this section are studies in which new concepts, tasks, materials, meth- 
ods, variables, or types of subject were studied more or less within the framework 
of Piagetian theory. Among new methods of developmental assessment are the use 
of a trick violation of object permanence to elicit varying amounts of puzzlement 
and searching in infants (233), and the use of movie presentation of conservation 
demonstrations to assess attainment of conservation (295). 

The ingenuity of Bruner and his colleagues was again demonstrated in the devis- 
ing of novel tasks, such as a lever to be used by 1- year-olds to rotate a turntable 
which thereby brought a toy within reach (223), or the use of seriated nesting cups 
to study the development of rulebound strategies in 1- and 2-year-olds (154). In both 
studies the now familiar outcomes of greater-than-expected orderliness and compe- 
tency and earlier-than-expected development of skills were obtained. 

The role of language in Piaget-type cognitive attainments continues to be studied, 
and not just at the symbolic and operational levels. For example, Sullivan (389) 
showed that children from 4 to 10 years old increasingly used constrained causal 
connectives and attended to the order of events presented in pairs of pictures. Ability 
to explain the temporal order as a clue to causal relatedness also increased with age. 
However, an experimental study will be needed to establish the contribution (as 
distinct from covariance) of language over the ages studied. Another language 
correlation was reported by Roberts & Black (330), who found that if children 
114 years old can follow an object through a sequence of invisible displacements (SM 
stage 6), then they tend to prefer and manipulate toys that are named by E to toys 
not named. 

Integration of Piaget-type and learning-theory-based conceptions is continuing to 
appear in the literature. Transposition learning was related to seriation competency 
in studies of size (358) and of happy-to-sad faces (211). Figurative vs operative 
knowing and the relationship between logical and perceptual knowing were studied 
in 4- to 9-year-olds in two contrasting studies. Overton & Brodzinsky (301) showed 
that while there are no direct influences of figural or perceptual features that enhance 
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two-way classification (conjunctive concepts) prior to the attainment of logical 
multiplication, there is a transitional period prior to the attainment of logical 
multiplication during which minimization of perceptual features enhances logical 
solution. However, in a simple drawing task with familiar objects, Freeman & 
Janikoun (126) demonstrated a developmental change between 7 and 8 years from 
intellectual realism (drawing what is known) to visual realism (drawing what is 
seen). Apparently the child can either be distracted by or corrected by visual cues 
more easily in a transition period than when he is intuitively certain (before) or 
logically certain (after). 

Finally, two innovative studies deserve mention simply because they extend Pi- 
agetian research into new domains. Rubin (340) confirmed Piaget’s prediction that 
children whose speech is less egocentric will be more popular with their peers, and 
thereby disconfirmed Vygotsky’s implication of an opposite relationship: that active 
and cooperative participants in peer interaction use more egocentric speech. As 
always in comparing Piaget with Vygotsky, the ccntention that their definitions of 
“egocentric” are equivalent and their theories arz in simple, direct opposition is 
somewhat strained. In a life-span developmental study, Papalia (303) demonstrated 
a progressive loss of conservation of substance, weight, and volume, but not of 
number in subjects over 65 years of age. 


Training Studies 


Recent training studies have generally gone beyond the simple contention that if 
training facilitates performance on Piagetian tasks, then Piaget is wrong, and if it 
doesn’t, he is right. This year’s crop provides some Dutstanding examples of ingenu- 
ity and subtlety in experimental acquisition studies. First consider the contribution 
of rules and feedback provided to the nonconserviag child. Siegler & Liebert (359) 
demonstrated facilitation of conservation of liquid cuantity, both in terms of correct 
answers and in terms of explanations, by either pres2nting relevant rules or complete 
feedback or both. The two treatments operated adcitively; a nonconservation series 
demonstrated that the facilitation was not merely a consequence of rote response 
acquisition; and the results were maintained a week later. Rosenthal & Zimmerman, 
(334) on the other hand, found that instruction did not facilitate conservation, and 
that providing a rule facilitated performance when both judged equivalence and 
verbal explanation were required, but not when judgment alone was required. 
Moreover, the same study demonstrated that observation of a conserving model 
alone and without further training facilitated conservation and generalization of 
conservation to new (unmodeled) stimuli. Praise cf the model’s responses had no 
effects, but a nonconserving model reduced the scores of initially conserving chil- 
dren. Peer-modeling effects were demonstrated by Murray (291) in an Asch-type 
experiment where one nonconserver and two conservers had to reach a consensus 
on standardized conservation problems. Gains in conservation judgment by the 
nonconservers were demonstrated in subsequent individual testing on the original 
problems and on new ones. Kuhn (229), on the other hand, failed to find strong 
effects of modeling of classification performances even when the model demon- 
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strated performances at various levels relative to the child’s own previously assessed 
level. 

Next we consider three studies, one on the role of compensation as an ingredient 
of readiness for conservation training, one on the effects of cognitive conflict in 
inducing conservation, and one on the role of true reversibility in acquisition of 
number conservation. Not only are they all training studies, but they all direct their 
modification efforts toward an alleged developmental prerequisite of conservation, 
rather than directly toward conservation performance itself. Curcio et al (82) found 
that 27 of 67 children tested for conservation of discontinuous quantity and compen- 
sation of height-width dimensions showed compensation but not conservation, 
whereas only one showed conservation without compensation. Training in conserva- 
tion produced gains for previously nonconserving compensators, as compared with 
untrained compensators and trained noncompensators. Post hoc, the degree to 
which compensators profited from training was predictable from the explicitness of 
their verbal justifications for compensation answers in the pretest. 

LeFebvre & Pinard (234) demonstrated acquisition of conservation of liquid 
conservation by the use of cognitive conflict based on compensation, addition- 
subtraction, or both, compared to a control group. On two successive posttests (2 
and 4 months later) all three experimental groups maintained gains over the control 
group but did not differ among themselves. 

A very ingenious procedure was used by Schnall et al (346) to distinguish, as 
Piaget does, between operational reversibility and mere empirical return to previous 
state as contributors to number conservation. Induction of conservation was not 
accomplished by three displacement procedures providing empirical return: (a) E 
spread elements and returned them by hand; (5) S did the same; and (c) S spread 
and returned them by a cardboard device. Induction was obtained, however, when 
S spread and returned the elements by stretching an elastic strip to which the 
elements were attached. The authors claim that the elasticity condition, because of 
the bidirectional tension involved, modeled the characteristic of operational revers- 
ibility, rather than merely empirical return, and therefore was successful. 

A deliberate application of individually programmed instruction to the acquisi- 
tion of multiple-classification skills was compared by Parker, Sperr & Rieff (305) 
with a terminal behavior training program. The individually sequenced instruction 
was based on a task analysis of prerequisite behaviors, while the terminal program 
trained criterion performances only, and a third group served as a contact control. 
Both training programs produced better improvements than the contact control but 
did not differ from each other. Six- and 7-year-olds benefited more from training 
than 5-year-olds. 


Formalization, Application, and Assessment Based on Piaget 


In this section are grouped a variety of studies which either formalize certain 
Piagetian notions, apply them in an educational context, or use them to assess 
individual differences in cognitive development. Case (54) has extended Pascual- 
Leone’s M, or “mental operator,” as the set measure of Piaget’s field of centration, 
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referring to the maximum number of activated scaemes that can be coordinated at 
any one time. Case demonstrates that by doing sone empirical curve fitting, he can 
confirm for a different task than Pascual-Leone hed used that estimates of M show 
an increase in value of one unit for every Piagetiar substage. Furby (131) presented 
a Piaget-based pretheoretical model for cognitive development, drawing heavily on 
Gagné’s model of cumulative learning sequences and stressing not maturational 
readiness, but acquisition of an ordered set of capabilities. Case (55) further linked 
Gagné’s model with Piaget’s by predicting (based on Pascual-Leone’s framework) 
that his experimental kindergartners would perfcrm at the same level as regular 
fourth graders on a verbal classification task and at the same level as regular second 
graders on a gestural classification task. The predictions were confirmed. This trend 
toward maintaining what has come out of Piagecian investigations as sequential 
invariance of acquisitions, while viewing with skepticism allegations of maturational 
necessity for that sequence, may indicate that cognitive developmental research is 
moving toward Gagné-like descriptions of cumulative learning sequences and away 
from the concept of development per se as a fundamental conception. On a more 
pragmatic level, Denis-Prinzhorn, Kamii & Moun»ud (89) have discussed various 
pedagogical implications of Piaget’s theory. 

In the domain of assessment, Pinard & Sharp (3 2) have provided a long-needed 
statement in English of the rationale behind (and some examples of) the Piaget- 
based intelligence test under development at the University of Montreal’s Institute 
of Psychology. The standardization of its 25 subtes’s or items on 500 children from 
4 to 12 is described. In a factor-analytic study Stephens et al (384) studied 27 
Piagetian reasoning variables together with achievement, and intelligence scores, 
MA, and CA. In general the Piaget items were loaded on different factors than were 
the standard intelligence and achievement scores. Kaufman & Kaufman (204) found 
that Lorge-Thorndike MA, a Gesell battery, and a Piaget battery, administered in 
kindergarten, all predicted Stanford Achievement Test scores at the end of first 
grade with positive correlations approximating .60 

The relation of Piagetian developmental status te MA and IQ is one of the most 
frequent questions raised by students and practiticners in both fields. One of the 
clearest examples to date is a well-designed study by Gruen & Vore (158) comparing 
familial retardates at MA 5, 7, and 9 years with no7mal children matched for MA 
and for CA. This highly quotable study used number, liquid, and weight conserva- 
tion problems and found that most of the variance in resulting performances could 
be attributed to MA and not IQ. Weight was more difficult than liquid, which was 
in turn more difficult than number, but exceptions were noted. Inequality (pseudo- 
conservation) problems were used as well as equality ones, which is another feature 
of the experiment that makes it a model for students. Like most investigators since 
the Mehler-Bever vs Flavell controversy, these investigators found that inequality 
was less difficult than equality at the ages studied. 

One final study comparing IQ (especially verbal) to Piagetian stage-wise develop- 
ment deserves separate mention because of its lorgitudinal design. Little (247) 
compared average and superior IQ children on thrze tasks each of conservation, 
seriation, and class inclusion at age 412 to 5, and repeated the assessment 2 years 


DEVELOPMENTAL PSYCHOLOGY 43 


later. IQ was associated with qualitative differences at both ages, but with stage 
differences only at the follow-up ages. The superior IQ children verbalized better 
at both ages: first in defense of their intuitive (and mostly wrong) responses, and later 
in explanation of their correct choices. Average IQ children were notably more 
dependent upon perceptual configurations at both ages. Unfortunately, the subjects 
were matched on CA rather than MA. 


Individual Differences: Styles and Strategies 


Studies of cognitive style and tempo, many stemming from Kagan’s work on reflec- 
tion-impulsivity, have become a popular subject variable in the study of cognitive 
development. In particular, attention to the information processing strategies differ- 
entially characterizing reflectives (or analytic responders) vs impulsives and the 
modifiability of styles have received increased attention. Following discussion of 
these studies, investigations of creativity, field independence, and a number of other 
cognitively evaluated individual difference measures are discussed in this section. 

Katz (203) used Kagan’s MFF test to identify reflective and impulsive preschool 
children, who then were given color- and form-matching tasks. Reflectives sponta- 
neously discriminated forms more than impulsives, though both could do so when 
prodded. In general the tendency to discriminate was related to form preference and 
proposed as an explanation of the greater form preference of reflective children. 
Denney (90) used second graders and the Kagan CST to identify analytic vs rela- 
tional responders. Exposure to reflective vs impulsive models or analytic vs rela- 
tional models change the styles and tempos of the children, and effects generalized 
to unmodeled tasks. Relational models produced more immediate change, while 
analytic models produced more durable change. 

A number of additional pieces of evidence support the general superiority of 
reflective children in information processing efficiency. Nuessle (294) found that 
reflective fifth and ninth graders were more proficient focusers in a concept identifi- 
cation task, that is, they were better able to eliminate irrelevant hypotheses. They 
also had longer latencies between feedback and next response, especially following 
negative feedback. Once again the inhibition of irrelevant response to make time for 
hypothesis revision appears to be their major advantage over impulsives. The visual 
scanning and search strategies employed by reflective and impulsive 9-year-olds on 
the MFF were studied by Zelniker et al (438) and by Ault, Crawford & Jeffrey (8). 
In addition to demonstrating that impulsives are poorer at sustaining attention than 
reflectives, they showed that the visual fixation patterns of impulsives and reflectives 
differ on the MFF. Moreover, when the logic of the task was reversed (find the 
different instead of the matching alternative) in a “DFF” task, subsequent MFF 
performance was altered. The main effect of DFF was an increase in comparison 
of alternatives and a decrease in looks at the standard during the DFF. This is a 
trend toward a pattern naturally characteristic of reflectives at this age. On the 
subsequent MFF, however, only the impulsives showed generalization of this shift 
in search strategy, and a reduction in errors among impulsives resulted. The second 
study included the subjects in the off-cells (fast-accurate and slow-inaccurate) who 
are customarily not included in studies of reflection-impulsivity. No difference was 
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found between comparision of standard and a variant and comparison of two 
variants, unlike previous studies, but both of these relatively systematic patterns of 
search were more common in reflective and fast-accurate 9-year-olds than in their 
impulsive and slow-inaccurate age mates. 

Adams (2) used the MFF to compare reflective and impulsive 6- and 8-year-olds 
on a probability learning task. Several patterns of guessing strategies were identified 
and defined empirically as more or less mature. As expected, for 6-year-olds, impul- 
sives showed less mature strategies than reflectives. Eight-year-olds, however, did 
not show such differences. 

One hypothesis about conceptual tempo states that the failure of impulsives to 
inhibit rapid and automated responding generalizes beyond the cognitive domain 
and characterizes their behavior in a variety of otter settings as well. The motoric 
inhibition of reflective and impulsive black middle-zlass preschoolers was studied by 
Harrison & Nadelman (174), who used the MF? and Maccoby’s Draw-a-Line- 
Slowly and Walk-Slowly tests, scored for both slowness (instructed) and irregulari- 
ties. Reflectives were better able than impulsives to inhibit motor movement, and 
the latter ability was positively correlated with MFF latency and negatively with 
MFF errors. Bucky, Banta & Gross (43), howerer, found no such relationship 
among 5-, 10-, 15-, and 20-year-olds, using the Maccoby motor tasks, the MFF, and 
an Embedded Figures Test. Motivational variables were considered important deter- 
minants of motor inhibition among the older Ss. Or the hypothesis that nonanalytic 
children on the CST are more anxious than their amalytic peers, Chiu (60) gave the 
CST to American and Taiwanese fourth and fifth g-aders together with the CMAS. 
No anxiety differences were found as a function of cognitive style for the American 
sample, but among the Taiwanese, nonanalytic children were more anxious than 
analytic ones. 

Conceptual strategies continue to interest developmental psychologists, and 
methods for their detection seem to become more subtle each year. Bruner’s distinc- 
tion between hypothesis-seeking and constraint-seeking strategies in a 20 questions 
game was used by Denney (91) in a strategy-model:ng study. Six-, 8-, and 10-year- 
old boys were shown videotaped models who demonstrated hypothesis seeking and 
two kinds of constraint seeking, one based on perceptible attributes, the other on 
functional attributes. Age was a determinant of susceptibility to modeling of strate- 
gies, but mere observation of the model was not in general adequate for acquisition 
of a previously unused strategy. 

Gholson, Levine & Phillips (140) used a blank-tr:als probe procedure after each 
trial of several discrimination problems to detect hypothesis testing strategies in 
kindergarten, second, fourth, sixth grade, and college students. Hypothesis testing 
appeared at grade two. Hypotheses used were classified as strategic (focusing, di- 
mension checking, and hypothesis checking) or stereotypic (stimulus preference, 
position preference, and position alternation). The kindergarten children showed 
mostly stereotypes; second to sixth graders used mostly strategic hypotheses, but 
never the focusing strategy, which is the most eflicien-. College students used almost 
exclusively focusing and dimension-checking strategies. 
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Analogous results were obtained by McKinney (270) using educable retardates 
at the MA 5-6 and 7-8 levels. Two strategies, conservative focusing and successive 
scanning were presented to two training groups, while a control group got no 
training. At the MA 5-7 level training in focusing did not facilitate conjunctive 
concept attainment, while scanning training increased efficiency but did not facili- 
tate concept attainment. At the MA 7-8 level both kinds of training facilitated 
concept attainment, with the focusing strategy being more effective than the scan- 
ning strategy. 

Coincidentally, two independent, five-year, longitudinal studies of creativity were 
published in 1972. Kogan & Pankove (219) retested in the tenth grade children who 
had been tested with the Wallach-Kogan tasks at grade five. Individual and group 
testing were used. Ideational productivity and uniqueness scores showed substantial 
stability over the interval for males retested by group administration and for females 
tested individually. IQ unrelated to creativity at grade five, remained so for females 
at grade ten, but became positively correlated for males. Cropley (81) had given six 
creativity tests to a sample of seventh graders, and followed them up 5 years later. 
Combining the tests into a single composit creativity predictor, he found correla- 
tions of .52 for boys and .46 for girls between that predictor and a composit 
achievement score based on extracurricular accomplishments in art, drama, litera- 
ture, and music. 

Witkin’s field independence, as measured by the Embedded Figures Test, was 
correlated with performance on the Stroop color-word interference test (fixity vs 
mobility) at five different age levels ranging from 10 to 86 years in a study by Eisner 
(105). Field independence is an inverted U-shaped function of age and is associated 
with mobility, while field dependence is associated with fixity. Fleck (121) used the 
Children’s Embedded Figures Test and a reversal-nonreversal task to identify field 
dependence vs independence and verbal mediation-nonmediation among kindergar- 
ten, first, and second grade boys. They were then given a variety of commercially 
produced Piagetian conservation tasks. Age and field independence were positively 
associated with conservational competence; verbal mediation was not. 

The last three studies to be cited in this section are concerned with personality 
and social behavior characteristics as determinants of individual cognitive compe- 
tencies. Crandall & Lacey (79) found that internal (vs external) control, defined as 
perception of academic success that is due to their own behavior (vs other causes), 
is related to speed and accuracy of Embedded Figures Test performance for girls, 
but not for boys, when age and IQ are partialled out. Ruble & Nakamura (341) 
compared field-dependent and field-independent children’s performance in a game 
during which relevant cues were sometimes provided by the experimenter. Accord- 
ing to the authors, the field-dependent children, because of their more social and 
less task-oriented approach, were better able to profit from the social cues given by 
E. The usual sex differences in field independence failed to have differential effects 
on attention to relevant social cues. In a study of the relationship between preschool 
social and emotional functioning and early school intellectual achievement, Kohn 
& Rosman (220) used rating scales to assess interest-participation vs apathy-with- 
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drawal and cooperation-compliance vs anger-defiance in 323 preschoolers. Interest- 
participation was correlated positively with schcol readiness and first and second 
grade achievement. For boys anger-defiance vs cooperation-compliance was not 
related to achievement, but anger-defiance was negatively related to intellectual 
achievement in girls, beginning at the second grede. Perhaps such findings should 
be discussed under the heading of teacher bias rather than cognitive styles. 

An efficient concept formation task originally developed by Huttenlocher was 
extended to younger children by Daehler (83). Im this two-trial procedure, a con- 
Junctive concept can be logically identified when there are two dimensions or proper- 
ties with two values each if two of the four instances are presented on trial 1 and 
S is told whether the positive instance is present, followed by a second trial in which 
one of the instances is reversed and the other is presented unchanged and S is again 
told whether the target instance is present or not. For example, if the stimuli are 
black or white and large or small, then a concept (2.g. large, black) can be identified 
in any of four different ways: it can be present in tne pair of instances of both trials 
(+ +), present on only one trial (+ — or - +), or present on neither trial (- A 
Typically - + problems have been shown to be essiest, since the novel element on 
the most recent (second) trial ss the concept. The — — problem has generally been 
the hardest, since the concept is never embodied in an instance actually presented, 
unless a matrix of all possibilities is also available to S, or the missing instance is 
by some other means logically indicated as the umtried possibility. By using faces 
differing in sex and miscellaneous features, Daehler was able to obtain better than 
chance solutions to all types of problems by 5- to 7-year-olds, and by 4-year-olds 
on the — + and + + problems only. A response bias favoring the cue added on the 
second trial and the cue present on both trials was implicated for preschoolers. 
When the concept was simply one of four different cues (not a dimensionally 
organized, 2 X 2 conjunctive concept), or when Oder children were used on the 
conjunctive concept, the - — problem requiring solution by negative instances was 
the easiest. 


SOCIAL AND PERSONALITY DEVELOPMENT 


A number of topics, potentially includable under this heading, have been excluded 
from review, including studies of adolescence, clinical summaries, case studies, and 
evaluation of various instruments and tests. Studies If the effects of birth order have 
also been eliminated because of the publication of e comprehensive review of such 
studies by Adams (1). The literature reviewed in this section is organized both in 
terms of subjects studied (ethnic, cultural, SES, and sex comparisons) and in terms 
of popular clusters of behaviors, traits, and influerces (attachment, achievement, 
moral development, and social communication). It 5 inevitable that certain studies 
included here deal as much with cognitive development as they do with personality 
and social development, and with increased concern for applied research in settings 
where whole children develop, we may expect more cf the traditional topical bound- 
aries to crumble in the next few years. 
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Racial and Cultural Comparisons 


Strictly racial comparisons of children’s performances in a variety of tasks are 
sometimes unfortunately confounded with SES differences, and the conclusions 
reached have correspondingly limited certainty. Slightly better are a number of 
studies in which SES is matched and held constant in order to make racial compari- 
sons, though the generality of results is often limited by the restriction in range of 
SES levels studied. The most conclusive studies, of course, systematically co-vary 
race and SES. 

The first group of studies to be considered have to do with racial attitudes per 
se. Cantor (51) developed a conflict paradigm for studying children’s associations 
between the terms “good” or "bad" and pairs of faces of white and black children. 
Choices and latencies of response consistently supported the hypothesis of stereoty- 
ping among white second graders and demonstrated the general utility of the 
method for nonverbal attitude assessment. McMurtry & Williams (271) demon- 
strated that training designed to weaken white children’s positive associations with 
light-skinned human figures and to strengthen their positive associations with dark- 
skinned figures generalized to reduce negative associations with dark skin. These 
and other similar results are discussed in the context of Osgood’s evaluation factor 
in the domain of affective meaning among preschoolers. Cantor (52) also demon- 
strated that the ratings of liking or willingness to associate with children is deter- 
mined jointly by familiarity and race. Familiarity enhanced the 9- to 11-year-old 
white children’s ratings of black but not white children’s pictures. 

Behaviors associated with cognitive competence were evaluated in a number of 
ethnic comparison studies. Zimmerman & Rosenthal (442) found that modeling and 
rule repetition facilitated concept attainment and generalization in both Mexican- 
American and Anglo-American fifth graders, the latter outperforming the former. 
Bucky & Banta (42) found that although black and white preschoolers differed on 
a test battery assessing motor impulse-control, reflectivity, innovative behavior, and 
curiosity (the whites doing “better”), the race of the tester was critical. For both 
groups of Ss white testers elicited better performances than black testers, and were 
rated from videotapes as providing a more positive social atmosphere, regardless of 
race of child. In a comparison of Canadian Indian children from a remote commu- 
nity with Canadian whites from several social and geographic groups, Schubert & 
Cropley (349) found that the lower WISC scores of the Indian group could be 
accounted for by reduced use of verbal regulation of behavior, which, in turn, varied 
as a function of contact with the white, urban culture. Taylor & Thweatt (393) found 
greater distortion in the Bender-Gestalt productions of Navajo 6- and 7-year-olds 
than in those of Anglos, but by age 12 there were no differences. 

Two studies made ethnic comparisons of performance on the Illinois Test of 
Psycholinguistic Abilities. Kirk (213) compared blacks, American Indians, and 
Mexican-Americans and found that blacks were superior in short-term auditory 
sequential memory, while the Indian and Mexican-American groups were superior 
in short-term visual sequential memory. Differing child-rearing practices are cited 
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to account for the obtained differences. Stephenson & Gay (385) compared black 
and white children matched at four SES levels and found that SES accounted for 
little variance in ITPA performance among the blacks, but that SES is related to 
scores of white children. No visual-motor vs audizory-vocal differences were found 
in this study. 

Two studies made ethnic comparisons on the age of achievement of Piagetian 
conservation. Gaudia (134) found that black children achieved conservation about 
a year later than Indian and white children (all lower SES), and that all lower SES 
groups are delayed in comparison to norms based on a racially and socially hetero- 
geneous population. Conservation was positively related to CA and [Q in all groups 
studied. Dasen (84) compared Australian Aborigiral children with Genevan whites 
and found that the order of acquisition was the same, except for a reversal of 
continuous quantity and weight, but that the Aborigines’ concepts developed more 
slowly. 

Black and white lower SES preschoolers and wh-te middle SES preschoolers were 
compared by Guttentag (164) in a study of spontaneous movement under a variety 
of controlled conditions. The black children emitted a greater number and variety 
of movement responses than did either of the wh te groups. 

A number of experimental studies treated ethnic group as a subject variable and 
examined its interaction with various treatment ccnditions, and these experiments 
appear to promise a more effective means of diseatangling and explaining ethnic 
differences than do the strictly comparative or normative studies. The effects of 
setting and procedural variables, experimenters anc observers, and stimulus matert- 
als used in comparing young, low-income, and mir.ority children with others were 
reviewed by Dickie & Bagur (97) in relation to larguage development. Digit span 
and short-term memory were used by McCarver & Ellis (266) to compare middle- 
SES whites and lower-SES blacks, both of which showed the expected effects of 
verbal rehearsal on short-term recali to the same cegree, in spite of the markedly 
superior performance of middle-SES whites in IQ. Sharp & Cole (354) compared 
West African children to previously reported Eurcpean and U.S. children on the 
developmental shift from syntagmatic to paradigmatic response patterns in word 
association. The shift occurs in a language structucally different from English, as 
predicted; paradigmatic responding is greater if the child has attended school; and 
the shift appears in general to occur later among West African children. Ciborowski 
& Cole (63) compared Liberian and American Ss varying in age and educational 
background in a disjunctive concept formation task. Ethnic differences found in the 
first experiment were eradicated in the second by zhe addition of a verbalization 
memory aid. Recall of printed language passages was studied by Nolen (293) as a 
function of the use of standard English, standard Negro dialect, and nonstandard 
Negro dialect. Second grade low-SES whites and Hacks did not differ from each 
other, but the whites recalled more at fourth grade. There were no effects of transla- 
tion of the materials into the dialects studied. 

The effects of adults, teachers, models, etc have ziso been studied in relation to 
socioethnic variables. Race of E was implicated in the Bucky & Banta (42) article 
cited above. In contradiction to their findings, Solko# (372) found that black exam- 
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iners elicited higher performances than white ones on certain WISC tests adminis- 
tered to black and white 8- to 11-year-olds. The other tests showed no differences 
attributable to race of E Black children scored lower than whites on those scales 
where differences associated with race of S were found, and race of child interacted 
with race of examiner on only one test. Liebert, Sobol & Copemann (245) found that 
black first- and second-grade boys imitated and recalled the choices of white models 
more than of black ones, and showed the same pattern of effects of vicarious reward 
and punishment to the model as had been previously demonstrated with white boys 
of the same age. White advantaged and black disadvantaged third and fourth 
graders were exposed by Portuges & Feshbach (314) to white female teachers 
displaying distinctive mannerisms and the use of positive vs negative reinforcement 
techniques. After observing the films, the children assumed the teacher role. The 
advantaged whites, the girls, and those who watched a positively reinforcing teacher 
showed the greatest imitation of her incidental behaviors. 

Contrary to the predictions of some students of the “disadvantaged” child, the 
enrichment of the preschool classroom with a variety of additional materials and 
equipment designed to stimulate perceptual and cognitive growth had no consistent 
effects on the 123 black children studied by Busse et al (46). The dependent variables 
included the Stanford-Binet, WPPSI (performance), and four scales from the ITPA. 

Black and white fifth-graders, half each of middle and lower SES, were given 
noncontingent feedback (success, failure, or no information) following a reading test 
by Friend & Neale (129). Then they were asked to attribute performance to some 
combination of ability, effort, task difficulty, and luck. White children judged that 
ability and effort were more important than difficulty or luck, while the reverse was 
true for blacks. Kagan & Madsen (201) used four experimental conflict situations 
to compare cooperation and competition among urban Anglos and rural Mexican 
children. No differences in motivation and ability to cooperate were found. The 
Anglos were more inclined to lower another child’s outcomes when no gain to self 
would result, but both groups would take a toy from another if they could keep it 
for themselves. Anglos were more conflicted when a peer displayed rivalrous intent, 
but Mexican children were avoidant of conflict. Similar results were obtained by 
Miller & Thomas (280) ma comparison of rural Blackfoot Indian with urban 
Canadian children. Under group reward conditions, both groups cooperated effec- 
tively. When reward was delivered on an individual basis, the Indian children were 
even more cooperative, while the urban children showed disruptive competition. 

Finally, a study by Baumrind (14) compared the patterns of parental authority 
and behavior of preschool children in black vs white families. The black parents 
appeared more authoritarian, but unlike their white counterparts, the most author- 
itarian of these families produced the most independent and self-assertive girls. 


Socioeconomic Status Comparisons 


Although obviously of importance in the preceding section, SES has been studied 
as the primary subject variable in the studies to be considered next. The first group 
contains correlational and survey comparisons; the second is composed of experi- 
mental studies. Tulkin & Kagan (402) found minimal differences between middle 
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and working class mothers in nonverbal interaction at home with their 10-month- 
old firstborn daughters, whereas every verbal behavior observed was more frequent 
among the middle-class mothers. Radin (318) observed fathers at home with their 
4-year-old sons, and compared middle- with lower-SES fathers on restrictiveness 
and nurturance and their sons on intelligence and masculinity-femininity. Over all, 
boys’ IQ was positively related to nurturance and negatively related to restrictive- 
ness. Middle-SES boys scored higher on IQ then lower-SES boys, and the latter 
showed a negative relationship between IQ and male sex role preference. Paternal 
behavior correlated with SES is concluded to be a more central determinant than 
other SES-related variables. Jones (197) compared high- and low-verbal ability 
fifth-grade boys matched on general intelligence. The high-verbal boys came from 
homes with greater opportunities for the use and. development of language, as well 
as from homes scoring higher on parental interaction rate, parents’ academic and 
vocational aspirations, and occupational status. 

Deprived (D) and privileged (P) Israeli preschoolers were grouped together in 
three preschools (mixed), while D-only children comprised a fourth. Free-play 
social interactions were observed at the beginning and end of the 2-year project. 
Deprived children in the mixed classes showed higher rates of social interaction than 
those in the D-only school, but in the mixed scaools both Ds and Ps interacted 
mainly within their own social group. The D-only children and the Ps in the mixed 
groups showed gains in cooperation with peers, while the Ds in the mixed groups 
remained very dependent upon adults (113). 

In a study of visual information processing speed, Bosco (31) used the tachisto- 
scopic masking threshold to compare middle- with lower-SES first, third, and sixth 
graders. The disadvantaged children required more time to process the same visual 
information, but the difference declined with increasing grade level. Spence (377) 
compared middle- and lower-SES preschoolers in four experimental and one control 
condition during a two-choice discrimination series. Control Ss were shown the 
correct stimulus after each choice response. Experimentals received verbal or non- 
verbal reward for correct responses or verbal or nonverbal punishment following 
errors. There was a large effect of social class favoring the middle SES group, but 
no main effects or interactions involving treatments. On the other hand, Figurelli 
& Keller (116) compared middle- and lower-SES black children ages 6 to 8 on 
conservation tasks before and after training (vs no training). Middle-SES children 
scored higher on both the pre- and post-tests then lower-SES children. Training 
facilitated improvement as a main effect in both SES groups, but lower-SES children 
required more training task repetitions to criterion than did middle-SES children. 

Fourteen months after completion of a 20-hour systematic conceptual learning 
program with preschool children from poverty bazkgrounds, Jacobson & Greeson 
(189) retested with the Stanford-Binet for retention of IQ gains. Not only were most 
of the gains retained in the follow-up study, but the original differential treatment 
effects (e.g. reinforcement only vs reinforcement p us additional information) were 
still discernable in the follow-up data. 
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Communication and Social Verbalization 


Two studies reported this year were concerned with infant vocalization as a function 
of social and acoustic stimulation. Dodd (98) presented 9- to 12-month-old infants 
with 15 min of prerecorded vocal, social-vocal, or social-only auditory stimulation, 
followed by 15 min during which their vocalizations were recorded. Vocal and 
social-only had no effects, but vocal-social stimulation increased the number and 
length of utterances containing consonants, though there was no evidence of imita- 
tion of the particular consonant phonemes presented. Webster, Steinhardt & Senter 
(416) presented 7-month-old infants with vowel sounds spoken at either high or low 
pitch. Fewer vocalizations were produced during high-pitch stimulation than either 
before or after than stimulation, and an increase in fundamental frequency occurred 
from baseline to high-pitch stimulation periods, but no decrease was observed from 
baseline to low-pitch stimulation. 

Three studies were concerned with the development of communication skills in 
children. Peterson, Danner & Flavell (309) found that both 4- and 7-year-olds 
reformulated an initial message when explicitly requested to do so, and both failed 
to do so when confronted only with nonverbal facial expressions of listener noncom- 
prehension. Only the 7-year-olds, however, reformulated messages in response to an 
implicit verbal request, such as “I don’t understand.” Shantz & Wilson (352) trained 
7-year-olds in trios (speaker, listener, and observer roles rotated) on tasks requiring 
verbal communication of critical cues to guide listener response. Peer discussion 
followed each of the six sessions. Trained Ss subsequently exceeded controls in 
communication of useful information and overall message quality and showed 
moderate transfer of communications skills. Hoemann (181) compared the quality 
and accuracy of peer communications in 8- and 11-year-old deaf and hearing chil- 
dren, the former using manual methods and the latter using spoken English. Poorer 
performances by the deaf children were attributed to experiential deficits which 
adversely affect communications skills. 

The likelihood of receiving a reply in response to a spontaneous verbal remark 
by one preschooler to another was studied in a free-play setting by Mueller (290). 
The most powerful predictors of whether a given communication would receive a 
reply were its technical quality and whether the speaker had the visual attention of 
the listener at its beginning. Nearly all communications aroused social interest in 
the listener and most of them received replies. A developmental trend was found 
by Solomon & Ali (373) characterizing which features of verbal evaluation state- 
ments are most important in the listener’s perception of objective meaning, recipi- 
ent’s consequent feelings, and speaker’s liking for the recipient. Positive, neutral, 
and negative levels of content and intonation were presented to subjects ranging 
from kindergarten to college age. Content was most influential for all three percep- 
tions for young children, followed by differential usage of content and intonation 
depending upon which kind of perception was asked for. The adults relied most of 
all on intonation for affective meaning, though content, of course, still determined 
objective meaning. 
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Sex Differences 


Corinne Hutt (188) makes a strong case for the bblogical determination of a number 
of sex differences in humans and other mammals. Physical growth and maturation, 
motor activity, and sensory capacities in infancy, mother-infant interaction, the 
onset of fear, social behavior of preschoolers, exp oration, play, creativity, and other 
intellectual functions are reviewed in terms of reuroendocrinological processes of 
sexual differentiation prior to birth. For example, it is argued that the female relies 
principally on linguistic skills in intellectual func:ioning and that the higher parent- 
daughter than parent-son IQ correlation typically found reflects this fact (together 
with the earlier maturation of the female) rather than greater genetic control of IQ 
in girls. Her review can be expected to produce ex:ensive replies and some argument. 

Females at ages ranging from 2 years to adulthood consistently outperformed 
males in the task of filling form boards with red, green, and blue disks in a study 
by Guttman & Kahneman (165). At each age the females scored higher (more like 
adults) in number of colors used, order of color placement, and integration of final 
pattern, although there were no instructions as tc correct or desirable responses. In 
a study of concept indentification in which availability of past correct vs past 
incorrect instances was the experimental variable. Pishkin (313) found that females 
outperformed males; availability of past-correct nstances facilitated performance; 
and older females (9 years) profited from availability of past instances, especially 
correct ones, more than their male peers and more than 6-year-oids. 

Broverman et al have previously stated the thesis that sex differences in cognitive 
abilities reflect sex-related differences in physio ogy. Their thesis has been once 
criticized and the criticism replied to in previous years. This year Parlee (306) 
renews the attack on the Broverman position in a more comprehensive and factually 
based manner. She criticizes their review of the literature on sex differences in 
cognitive abilities as being selective and arbitrary in characterizing those facts which 
it does include. She further points out that the phermacological studies they cite are 
based on “dubious cross-species analogies” and that they have made a factual error 
in citing evidence in support of their proposed mechanism by which sex hormones 
might affect autonomic activity. Clearly the debete is just beginning. 

Several studies this year have been concerned with sexual stereotypes, most 
notably those associated with body build. Lerner & Korn (238) asked chubby and 
average build males at three age levels (5-6, 14-15, and 19-20 years) to attribute 
items from a verbal checklist to pictures of an endomorph, a mesomorph, or an 
ectomorph, and then to judge the items as to 200d and bad connotations and 
applicability to themselves. Later Ss identified their own body build and indicated 
their preference for one physique. In general at al age levels Ss of both body types 
had a favorable view of the mesomorph, a very unfavorable view of the endomorph, 
and a somewhat less negative, but still unfavorable, view of the ectomorph. Staffieri 
(382) found analogous results for females aged 7 to 11 years who were asked to 
attribute adjectives to figures representing endom:orphic, ectomorphic, and meso- 
dysplastic female body builds. A common set of behavioral and personality descrip- 
tions associated with the body types was found az all ages: all adjectives assigned 
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significantly more often to the mesodysplastic body type were positive and all 
assigned differentially to the endomorphic type were negative. Clearly desirable and 
undesirable body-type stereotypes for one’s own sex are well established by age 5 
to 7 and reflect the rater’s own preferences. Both samie-sex and cross-sex stereoty- 
ping of both body build and facial expression were supported in a study of elemen- 
tary school children by Felker (115). 

In a study employing Brown’s IT Scale, Fling & Manosevitz (122) found low 
correlations between parents’ scores and those of their 4-year-old children. In gen- 
eral parents discouraged cross-sexed play interests in boys more than in girls, and 
parents of the same sex as their child reported more encouragement of sex typing 
than did opposite sex parents. A masculine bias in children’s responses to the 
standard “IT” card was also reported. 

Meyer & Sobieszek (276) used videotapes of 17-month-old children sometimes 
described as boys and sometimes as girls to secure ratings and judgments from 
adults, including sex-role items. While there was little evidence that attributed sex 
of child influences the judgments of tapes directly, the judges described a child said 
to be of their own sex as having more qualities, both male and female. 


Social Behavior of Infants 


Following closely upon the rediscovery of infants’ perceptual and cognitive capaci- 
ties has been the study of their preverbal vocal behavior and receptive speech 
abilities. Accompanying this second wave is increased attention to the nonverbal 
social behaviors of infants, especially crying, smiling, and laughing. The following 
section will then review studies of mother-infant attachment. One apparent trend 
in infant work is the tendency to treat the state of arousal of the infant as a datum, 
rather than as an annoyance to the investigator. Korner (221) discussed state as a 
mediator of infant stimulation, and Lewis (240) considered it as an index of infant- 
environment interaction. Both made strong pleas that state, like other individual 
differences, not merely be considered as error variance. Another trend is the use of 
well-developed visual fixation methodology to study infants’ social attending and 
social object permanence when animate stimuli are compared with inanimate ones 
(153). 

Dyadic interaction between adopted infants and their mothers was observed by 
Beckwith (16), who also obtained PARI protocols from the mothers. Responsive- 
ness of babies to the mother in social play was a negative function of how suppressive 
and critical the mother was, and was also negatively correlated with responsiveness 
to a stranger. Quantity of contact between mother and infant was negatively related 
to infant self-stimulation rate and to frequency of ignoring maternal social overtures. 

Crying of infants, a behavior usually studied under hospital or laboratory condi- 
tions, was studied normatively in the home setting over full days of tape recordings 
by Rebelsky & Black (321). Although the sample was small (10), the thoroughness, 
naturalistic setting, and duration of the study (1 week to 3 months) make these 
normative data useful. Contrary to a simple reinforcement model, Bell & Ainsworth 
(18) found that consistency and promptness of maternal response is associated with 
a decline in frequency of crying over the first year of life. Close physical contact was 
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found to be the most effective means of terminat ng crying. Development of noncry- 
ing modes of communication to the mother also is associated with maternal respon- 
siveness (both consistency and promptness) to infant signals. Korner & Thoman 
(222) compared vestibular-proprioceptive and Jose contact methods of soothing 
newborn infants. Unlike Bell & Ainsworth’s oder infants, they found that new- 
borns’ crying rates during and after soothing were sharply reduced by vestibular 
stimulation, while contact was less effective during and ineffective after the soothing 
period. The procedure used, unlike that of other studies, carefully controlled contact 
and vestibular stimulation both separately anc in combination, and the authors 
suggest that the vestibular component in uncontsolled studies of soothing by contact 
may be the controlling factor. Bell & Ainswortl, however, indicate that significant 
learning to be soothed takes place over the first year, so Korner & Thoman’s 
conclusion must for now be restricted to newborns. 

The Meaning of Smiling and Vocalizing in Infancy was the title of an APA 
symposium in 1970, belatedly published in 1972 (166). In it Watson (413) discusses 
smiling and cooing as social indicators that the child has discovered a contingency 
relationship between two or more recurring events in his environment. The signifi- 
cance of this process for the development of cognitive and social schemas is dis- 
cussed. Zelazo (437) emphasized the nonsodal determinants of smiling and 
vocalizing, such as moderate novelty in both sozial and nonsocial stimuli. McCall 
(264) sums up the cognitive position on these behaviors, reminding us that our 
definitions distinguishing smiling as an operant from smiling as social communica- 
tion, from smiling as an index of cognitive processes involving recognition and 
novelty are not so clearly distinguished by infan-s, but are components of an emer- 
gent perceptual-cognitive process. 

Finally, Sroufe & Wunsch (381) have traced developmental changes in classes of 
stimuli that elicit laughter during the first year cf life. Tactile and auditory stimuli 
are effective with younger infants and become less potent in the second half-year. 
Most visual and some social stimuli become inczeasingly effective in the latter part 
of the first year. 


Attachment and Mother-Infant Relationships 


The following studies are ordered by age of chiid and range from neonatal breast 
feeding interactions to stranger-reactions in the third year of life. The experimen- 
tally studied determinants of attachment usually pit the mother against a cognitively 
interesting but nonsocial stimulus, while the dependent variables are distress upon 
separation or following and approaching the mother. Thoman, Leiderman & Olson 
(395) compared breast feeding of 2-day-old infants by primiparous vs multiparous 
mothers and found that the former spent more t.me in nonfeeding activities; spent 
more time feeding male infants; changed activities more often; provided more stimu- 
lation, including verbal, and especially with feraales; and smiled more at female 
infants. In turn the primiparous infants spent less time attached to the nipple and 
less of that actively sucking. For primiparous mothers, the number of weeks the 
mother continued breast feeding was correlated positively with the amount of time 
the mothers spent talking to their infants at the 2-day-old feeding observation. 
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Premature infants who remain in the hospital for the first 3 to 12 weeks of life 
are subsequently smiled at and held close to their mothers’ bodies less than full-term 
babies in a study by Leifer et al (236). But within the premature group those who 
had full sensory contact with the mother during hospitalization did not differ from 
those who had visual contact only during that interval in the degree to which they 
were subsequently smiled at and held closely. Corter, Rheingold & Eckerman (77) 
found that the number and novelty of toys available in a strange room governed the 
promptness with which 10-month-old infants followed their mother upon her depar- 
ture from the room. 

Two studies by Coates, Anderson & Hartup (69, 70) investigated attachment 
behaviors at 10, 14, and 18 months of age. While touching and proximity to the 
mother were stable attachment indicators over both long and short terms, almost 
no stability over either duration was found for visual regard, vocalizing, crying, or 
orientation to the locus of the mother’s disappearance. Visual regard, touching, and 
proximity were mutually associated behaviors that increased after separation, 
whereas crying and orientation to locus of disappearance increased during separa- 
tion. 

Messer & Lewis (275) studied 13-month-old infants in a standard playroom with 
available toys in the presence of the mother and found SES and sex differences. 
Lower SES infants vocalized much less in these sessions than did middle SES 
infants, and lower SES infants showed smaller sex differences. Girls showed more 
attachment behaviors in general, vocalized more, and showed more mobility than 
boys. Wenar (420) confirmed that exploration of the physical environment takes 
precedence over initiating social interaction with the mother in a free choice situa- 
tion with 2-year-olds, but found no relationships among extensity, duration, and 
level of the alternate activities. 

An excellent monograph by Maccoby & Feldman (253) studied attachment be- 
haviors during the third year of life in American and Kibbutz-reared Israeli chil- 
dren. The two populations did not differ in general; disturbance over separation 
declined from age 2 to age 3; distal attachment behaviors toward both mother and 
stranger increased with age; but looking at them did not. At age 2 friendly interac- 
tion with a stranger was contigent on mother presence; by 3 it was not. 


Family Structure, Child Rearing, and Personality ` 


What used to be the hard core of correlational developmental psychology is repre- 
sented this year by a diverse group of articles. The child-rearing and familial 
antecedents of specific behavioral characteristics is simply no longer regarded as the 
chief business of developmental psychology. At least the articles discussed below 
will illustrate the heterogeneity of methods and concerns that characterize the field. 
Antecedents of social desirability response tendencies in 6 to 12-year-olds were 
studied by Allaman, Joyce & Crandall (3), using the children in the Fels longitudinal 
study. Using the same population, Battle & Lacey (13) studied parental antecedents 
and behavioral correlates of hyperactivity throughout childhood, adolescence, and 
young adulthood. 

Using fourth- to eighth-grade lower SES children’s reports of parental childrear- 
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ing behavior, Armentrout & Burger (6) found he same factor structure (for both 
sexes) as had been found earlier: acceptance vs rejection, psychological autonomy 
vs control, and firm vs lax control. Sex and age cifferences in factor scores were also 
reported. 

Little or no support for a sex-by-birth-order mteraction was found in a study of 
sex-role preferences and intelligence by Laosa & Brophy (232), in some instances 
because the expected birth-order differences werz not found. Other variables which 
did show sex differences did so more with first- than with later-born children, 
confirming that sometimes first-borns are more "horoughly socialized. Burton (44) 
studied sex-typing in Barbados children as a function of degree of father absence. 
Controlling a host of demographic variables did rot modify the fact that availability 
of male-role models did not systematically determine sex-role behaviors, while the 
status-envy hypothesis fared better. Both conflict 3f cross-sex identity and consistent 
feminine identification do occur in father-absent males in that culture. The critical 
period for absence appears to be the first 2 years of life. 

In a reversal of the usual design, Osofsky & C’Connell (300) studied the effects 
of the behavior of 5-year-old girls on that of taeir parents in a laboratory task 
varying in dependency producing features. The parents interacted more and were 
more controlling when their daughter was dependent. Mothers were more encourag- 
ing and fathers more directly helping under dep2ndency conditions. 

Sex and sibling status influence the effectiveness of child-teachers of concepts, 
according to Cicirelli (64). Males got higher concept attainment scores than females 
as learners, but female sibs were more effective tzachers than male sibs or female 
nonsibs, regardless of the sex of the learner. 

A study of still more applied significance was reported by Woods (434) on the 
unsupervised activities of fifth-grade ghetto children whose mothers were employed. 
A battery of psychological tests, teacher ratings, ard school and community records 
indicated positive influences associated with full-time employment of the mother, 
quality of mothering, mother-child relationships, and mother’s attitude toward 
employment. More girls than boys were unsupervised, and unsupervised girls 
showed impoverished cognitive development. 

Finally, it should be noted that a friendly deba-e is in progress between Burton 
(45) and Cohler (72) regarding the validity, utility, and interpretability of retrospec- 
tive accounts by parents and grandparents of chilc-rearing attitudes and behaviors 
in the study of intergenerational attitudes and changes in prevailing cultural values. 
The debate is constructive and a useful caution to both students of culture change 
and life-span developmentalists as well as to the nvestigators of child-rearing to 
whom it is directed. 


Achievement Motivation 


Concern with achievement and level of aspiration ñas focused heavily this year on 
the effects of evaluations by, and expectations of, significant adults. Achievement 
motivation was contrasted with fear of failure in 9- and ten-year-olds by Hermans, 
ter Laak & Maes (179) in a study of parent child nteractions during task perfor- 
mance in the home. Parents of high achievement-motivation children gave less 
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specific help, but more nonspecific help and reinforcement than those of low achieve- 
ment-motivation children. The high achievement-motivation children more often 
refused help than the lows. The parents of the high-anxious children responded less 
to the child’s insecurity, provided more negative tension release, and witheld more 
reinforcement than those of low-anxious children. The combination of high motiva- 
tion and low anxiety was characteristic of children whose parents had high expecta- 
tions of their child’s performance. Other correlations confirmed with Dutch 
children the behavior patterns established previously with American children asso- 
ciated with achievement motivation and fear of failure. 

Children who perceive themselves as responsible for their own degree of success 
(high accountability or internality) do better than others on spelling, vocabulary, 
and math tests on which they set their own level of difficulty, according to Clifford 
& Cleary (66). These relationships held at each grade level studied (fourth through 
sixth), and internality predicted performance better than IQ did for boys, while the 
reverse was true for girls. Experimentally varying internal vs external evaluation, 
Maehr & Stallings (255) found that under internal conditions children showed a 
continuing interest in working on difficult tasks, especially if they were high achieve- 
ment-motivation boys. Performance was not as systematically influenced as was 
interest in continuing. 

Two studies have shown still more effects of adult expectancy on children’s 
performances. Entwisle, Cornell & Epstein (107) found that positive expectations 
by a male principal following a first test facilitated performance on a second test for 
boys, but not for girls. In a marble-dropping task, Dusek (101) found that adult 
social reinforcing agents led to believe that the 9- to 10-year-old boys would outper- 
form the girls they tested did not produce E-bias effects favoring boys. The experi- 
menters told that girls would do better produced an E-bias effect, but only among 
low-anxious Ss. High-anxious Ss of both sexes were more responsive to reinforce- 
ment, regardless of E-bias. 

Finally, a correlational study by Cattell, Barton & Dielman (56) showed that 
using objective measures of motivation and factored personality tests, personality, 
motivation, and ability variables each contribute independently and significantly to 
school achievement tests administered 3 months later. The potency of the three 
types of predictors varied with the academic area assessed on the achievement tests 
given to sixth and seventh grade children. 


Moral Judgment and Behavior 


An offshoot of the Piagetian tradition is the study of the development of moral 
judgment, which has been generally studied independently of moral behavior. The 
latter, however, stems from an older theoretical concern with identification and 
behavioral! inhibition found in both psychoanalytic and social learning theory. The 
topics are reviewed in that order, followed by a discussion of the intersect of the two, 
which is beginning to receive some overdue attention in this most popular and 
relevant area of investigation. For this purpose considerable latitude has been used 
in the definition of what behaviors and judgments are considered to be "moral" 

Shantz & Pentz (353) asked 6-, 10-, and 13-year-old boys to indicate the amount 
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of punishment that aggressive boys in stories should receive, manipulating type of 
provocation, amount of retaliation in the story, and target retaliated against. Ten- 
and 13-year-olds found retaliation against sibs and peers less punishable than retalia- 
tion against fathers regardless of provocation, while 6-year-olds made such target 
distinctions only following physical provocation. All Ss indicated that retaliation 
matched to provocation (physical vs verbal) was more acceptable than mismatched 
retaliation if the provocation came from a father. But when physical provocation 
came from peers, they judged that verbal was preferable to physical retaliation. 

A similar method was used by Gutkin (163) to study moral judgments of fourth 
graders concerning the moral justifiability of stoties in which the intentions of the 
protagonist were either good or bad and the daraage done was either high or low. 
The judgments were scalable and yielded an apparent developmental sequence of 
four stages: (a) intentions irrelevant, damage dezermining; (b) intentions relevant, 
but damage more important; (c) intentions more important than damage, but 
damage still relevant; and (d) intentions alone relevant, damage irrelevant. 

Jensen & Rytting (191) used systematic story variation in a study of immanent 
justice attribution by second-graders. In each stery a misdeed was followed by an 
accident to the perpetrator that could be seen as impersonal! punishment (immanent 
justice).. The stories varied in whether naturalistic information was provided that 
could explain the real causes of the misfortune and whether the misdeed and the 
subsequent misfortune were presented as relatec. The more directly related were 
misdeed and subsequent misfortune, the less immanent justice was seen. The inclu- 
sion of naturalistic information only reduced immanent Justice responding in stories 
that also provided a direct relation between the two events. 

Merchant & Rebelsky (274) selected two grouss of kindergarten and first-grade 
children on age and sex, all of whom had reached Piaget’s stage of moral realism 
(morality of constraint) and none of whom had yet developed a consistent response 
characteristic of moral relativism (morality of cooperation) with regard to game 
rules. In the participation group each child mad: up rules for a new game, while 
his counterpart in the nonparticipation group was taught the rules made up by the 
child in the first group. As a check on this manipalation each child was then asked 
who made up the rules, and as intended the partidpation Ss reported that they did, 
while nonparticipation $ reported that E had made them up. Then all children were 
given a flexibility-of-rules test on which the participation group scored higher. 
Surprisingly, the first graders were not more flexinle than the kindergarteners, and 
a pre-posttest comparison of flexibility on anotker game showed declines for all 
- subgroups except the kindergarten participation group. 

Next are considered studies of moral behavicr defined as both production of 
prosocial behaviors and/or inhibition of antisocial behaviors. Although often some- 
what artificial and of limited ecological validity in laboratory studies, sharing and 
generosity with limited resources is the most popular prosocial behavior studied. 
Poulos & Liebert (315) found that modeling and verbalization by Æ each increased 
the number of shared tokens and the number of children who shared. Surveillance 
by an adult did not enhance the children’s sharing behavior, but it enhanced the 
positive effects of adult verbalization on that behavior. The particular standard of 
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sharing modeled or preached was not differentially adhered to as a function of 
treatment, though Ss exposed to it could recall it later. 

Solomon & Druckman (375) conducted an ingenious study in which pairs of peers 
age 7 to 9, 10 to 12, and 13 to 15 played a game in which the total earnings of one 
member A from a previous game were distributed between the previous player A 
and his partner B. The A members were designated as winners or losers depending 
upon whether they had won more or less than their opponent in the previous game, 
and the B members knew how the A members had fared previously. A members 
who had been losers the first game wound up with less money than B members in 
younger dyads, more than B members in older dyads, and about the same as B 
members in intermediate age dyads. 

Using essentially the same games, the first-game performances of A subjects who 
knew they would later share their winnings with a B partner (whom they repre- 
sented, in a sense, in the first game) played that first game more cautiously, persis- 
tently, and competitively than did Ss who did not represent a partner. This 
difference in responsiveness to role obligations held only for males, however (100). 

In a highly artificial situation Fouts (124) gave 10- and 11-year-olds the opportu- 
nity to pull a lever that delivered money into a box labeled charity or one that 
delivered the money into a neutral box. The children responded more to the charity 
box in general, and the presence of an observing adult had no significant effect on 
responding in either situation, contrary to literature-based expectations. 

On the antisocial side, Liebert and Baron (244) exposed 5- and 6-year-old and 8 
and 9-year-old boys and girls to excerpts from actual TV programs depicting either 
aggressive or nonagressive scenes, and they were then given an opportunity to 
allegedly hurt a peer. Frequency of aggressive behaviors in subsequent free play was 
also recorded. Children exposed to aggressive TV engaged in longer attacks against 
an ostensible child victim than those exposed to nonaggressive TV. The aggressive 
program also produced higher frequencies of aggression in free play, especially 
among the younger boys. 

Stein & Bryan (383) used a televised model who verbally encouraged either 
conformity or violation of rules concerning rate of self reward. In each model 
verbalization condition the model himself then either conformed with or violated 
the rules. The child’s verbal explanations were mainly affected by the model’s verbal 
stance, but the child’s behavioral conformity or violation depended upon an interac- 
tion between the model’s words and deeds. The greatest rule violation occurred 
when M was inconsistent, preaching violation but practicing conformity. The next 
greatest violation occurred when M was also inconsistent, but in the reverse direc- 
tion. Models whose words and deeds matched elicited still less violation by Ss, but 
more when M practiced and preached violation than when M practiced and 
preached conformity. 

Academic cheating in fifth graders was studied by Johnson & Gormly (192) as 
a function of the obviousness vs subtlety of the measure used to assess cheating. 
Obviousness of the measure used in previous studies accounts for the reported 
relationship between academic ability and cheating. While a main effect of sex was 
not obtained in this study, the interaction of sex with other variables suggested that 
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females were more likely to cheat when they were characterized as having unfavor- 
able self perceptions, while males were more irfluenced by immediate situational 
factors. 

Stouwie (386) had two adults give children corflicting instructions in a resistence- 
to-temptation task, one permissive and the other prohibitive. One adult was always 
dominant and the other nurturant. Most resistenze followed dominant prohibitions 
and most deviation followed dominant permissive instructions. All Ss increased in 
deviant responding over time. The role of maternal verbal communication style and 
content as an antecedent of resistance to temptation in Canadian and Italian families 
was analyzed in a monograph by Greenglass (155°. Cheating was higher in Canadian 
than Italian boys, but no difference was obtained for girls, and no overall sex 
difference was found. Cheating was not related to motivation to succeed or to actual 
earned scores. Italian mothers were more authontarian and restrictive than Cana- 
dian mothers with their sons. Canadian mothers used more reasoning with their 
sons. In both cultures mothers were more authoritarian and restrictive with their 
daughters than with their sons, and girls in both ultures who perceived parents as 
exerting control over them cheated less. 

In a study of Mexican-American children in g-ades 2 through 8, Solomon et al 
(374) assessed five democratic values by interv:ew and again behaviorally in a 
group-problem-solving and dart-throwing sessior-. Four of the values showed age 
increases when assessed verbally but not when assessed behaviorally. These were 
equality of representation, equality of participatior, assertion, and compromise. The 
fifth value, equality of resource distribution, was selatively high and invariant over 
ages when assessed verbally, while its behavioral index was the only one to show 
an increase as a function of age. 

Greenglass (156) found that altruistic lying (to protect another who has given the 
child unauthorized help in a prior task) increased as a function of age. 

Turiel & Rothman (403) pretested seventh and e:ghth grade boys with Kohlberg’s 
moral judgment interview, then offered them a choice between two actions. Prior 
to choosing, each S heard one choice supported by reasoning at the stage just above 
his own on the Kohlberg scale, and the other choice supported by reasoning at the 
stage just below his own. The S then made his ckoice and gave a rationale for it. 
A posttest showed no significant overall stage changes, but reasoning at a stage 
above AS es own led to a shift upwards for stage 4 children. 

A direct comparison of judgment and behavior was made by Saltzstein, Diamond 
& Belenky (343). Seventh-grade children were assessed by the Kohlberg interview 
and then tested for conformity in an Asch-type group-influence situation. Stage 3 
children conformed more than either higher or lower stage children, and high stage 
children made very few conforming responses. Stage 3 is an intermediate one 
characterized as mostly “good-boy, good-girl, approval-seeking morality.” Fodor 
(123) conducted a similar study relating Kohlber’s stages to susceptibility to influ- 
ence by others and to delinquency. A great many more such studies bridging the 
gap between the development of judgment and the adoption of behaviors will be 
needed (and are beginning to appear) before the relationship between these two 
domains is adequately clarified. A further bridge nzeds to be built between moral 
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behavior and judgment on the one hand and modeling and TV influences on the 
other, so that the practice vs preach variable in the latter studies can be coordinated 
with the behavior vs judgment relationship in the former. 


REJOINDERS, REBUTTALS, CORRECTIONS, AND CAUTIONS 


This section of the review will cite a number of the past year’s publications whose 
special characteristic is their attempt to correct an error in fact, logic, or method, 
or to alert the field to potential future errors of that sort. Certainly such reports 
could be summarized under the substantive topic headings to which they refer. On 
the other hand, they often represent a very special, quite valuable type of profes- 
sional behavior—-a sort of self-control mechanism for the field as a whole. Primarily 
for that reason, they will be treated as an area of their own. Since grouping by topic 
would accomplish virtually no grouping at all, the reports will be presented roughly 
in rank order along a dimension of specificity-generality, the most specific first. 

Sostek, Sameroff & Sostek (376) have summarized very carefully conducted, 
throughly controlled research on the conditionability of the Babkin reflex and 
concluded that it is not a conditionable reflex. The Babkin reflex is a gaping response 
of the infant’s mouth, accompanied by a turning of the head to midline while raising 
it (sometimes accompanied by eye closing and arm flexion). The response is elicited 
by a simultaneous pressing of the infant’s palms. Earlier studies had suggested that 
it could be conditioned to a CS of passive motion of the infant’s arm. Such studies 
lacked appropriate evaluation of the unconditioned effects of their CS over repeated 
trials and of the relationship of the infant’s state (alert vs drowsy) to the response. 
The present study included such measures. It found no evidence for conditioning; 
instead, there appeared a positive correlation between drowsiness and the probabil- 
ity of the response, and state measures indicated that infants typically grew drowsier 
as the “conditioning” session progressed. Furthermore it was found that some 
infants displayed the Babkin reflex as an unconditioned response to arm movement, 
rendering it thoroughly unsuitable as a CS. 

Warren (410) has noted that there is a general belief that African infants (defined 
as black peoples of sub-Saharan Africa) show precocious development in their first 
year of life; the belief is based on a number of studies measuring African infant 
characteristics and comparing them to test norms. However, studies which compare 
samples of African infants to similar samples of non-African infants, measured 
directly for purposes of comparison, do not show systematic differences. Warren 
suggests that better assessment techniques, more directly applied, are required for 
a final conclusion. Meanwhile, Lusk & Lewis (251) once again reported precocity 
for African infants; as Warren’s argument predicted, their comparison was to 
norms. 

Chandler & Greenspoon (58) on the one hand and Borke (30) on the other have 
presented some mutual suggestions for corrections of conclusions and terms con- 
cerning their research on egocentrism. Borke had previously presented data suggest- 
ing that children display empathy with others’ viewpoints as early as 4 years of age. 
Chandler & Greenspoon, in the cited paper, argued that if this were correct, it would 
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“force a major revision in the theories of Page and his co-workers” (58, p. 104). 
Chandler & Greenspoon then presented data based on a distinction between role- 
taking or perspective-taking skill and the more primitive ability to predict the 
feelings of others; their data suggested that wile the latter ability (presumably 
Borke’s “empathy”) might be seen in quite young children, the former skill (the 
critical one in denying egocentrism, in their opinion) was still not solidly developed 
by middle childhood. In reply, Borke (30) ackno=ledged Chandler & Greenspoon’s 
data and the definitional logic of their approack but argued against the utility of 
making the distinction they had. Instead, she suzgested that empathy be viewed as 
a steadily developing concept, one in which early manifestations were predomi- 
nantly predictions of others’ affect, but which imcreasingly became more complex 
until it encompassed the perspective-taking skills cited by Chandler & Greenspoon 
as the critical characteristic. In effect, the argument was largely resolved in terms 
of what the course of such development was; what remained for Borke were the 
social implications of the semantic attribution cf “egocentrism” to children until 
adolescence. Her argument was reminiscent of Eizznberg‘s on aggression: “What we 
believe of man affects the behavior of men” (104, p 124). Perhaps Borke was arguing 
that if the (agreed on) characteristics of young children are labeled “egocentric” by 
developmental science, the children will be mad= more so than they are. 

A recent book by Easton & Dennis (102), entitled Children in the Political 
System; Origins of Political Legitimacy, has obwous relevance for developmental 
psychology. Krause (226), however, has taken the authors to task. He pointed out 
that they have sought a psychological cause of arr general affection or respect that 
people may have for governmental and other autheritative social offices, but in doing 
so have strained both their data and their logic. Krause offered instead a psycholo- 
gist’s reconsideration and caution. 

Infantile autism has been a busy focus in the past year for rebuttals. Two such 
were inspired by Ward’s (407) postulation of an =ssentially environmental or psy- 
chogenic etiology of infantile autism. L’Abate (23C ) argued that Ward’s conclusions 
were derived from too selective a review of the relevant literature, and consequently 
neglected the possibility of more than environrzental factors interacting in the 
causation of autism. Furthermore, L’Abate hoped that newer treatment techniques 
would eventually produce a clearer definition of the entire syndrome, which inevita- 
bly would make arguments about etiology more >recise. Rimland (328) also ob- 
jected, arguing that Ward’s use of the term “autzm” was too broad and allowed 
consideration of irrelevant literature, that he hed misinterpreted some classical 
behavioral specifications of autism and that ther= were several recent references 
suggesting biological factors in the etiology of au: sm. In that both rebuttals gave 
strong emphasis to the widespread lack of agre=ment on how to use the term 
“autism,” it appears that many such arguments will continue to be arguments about 
definitional issues. 

Measurement problems were cited for several locas-of-control studies in a caution 
by Gorsuch, Henighan & Barnard (150). They pointed out that many locus-of- 
control scales are often used with children of low -erbal ability, and that while the 
reliability of such scales may amount to .60 with verbal fourth and fifth graders, 
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their reliability is not even susceptible of estimation in the case of younger or 
nonverbal children. Such scales, characterized by different reliabilities in different 
subgroups of an experimental design, will yield misleading data under either of two 
conditions: when the mean of reliably measured children differs from a random 
response level, or when the variable identifying the subgroups interacts with the 
target of the scale. Both conditions are usually met in locus-of-control studies with 
children of systematically varying verbal abilities. Although issue was joined specifi- 
cally with the procedures of certain locus-of-control studies, it is clear that the 
caution offered is a general one with many applications. 

Longitudinal vs cross-sectional approaches to the study of development were the 
subject of two cautions. Bergman (23) argued on technical grounds that longitudinal 
samples with repeated measurements yielded better results than independent sam- 
ples; his report also offered a regression approach to deal with the problem of large 
groups “dropping out” of longitudinal samples. Schaie (345) presented a more 
complicated caution in the same vein, specifically citing data from the Harvard 
growth study which, upon reanalysis, indicated serious limitations on their general- 
izability, even though estimated over such short spans of time as 4 months. 

The entire literature of birth order effects came under question twice in the past 
year, once with a possible reinterpretation, and once with a severe challenge to its 
existence as other than a statistical artifact. McGurk & Lewis (269), reporting some 
birth order effects with young children, found that second-borns sought more adult 
help and approval, spent more time in individual activity, were more talkative, and 
displayed negative affect more often than comparison children. However, these 
authors also found that their second-borns on the average were only 18 months 
younger than their older sibs, but 30 months older than their nearest younger sibs. 
Thus, they argued, mothers should have more time to attend to first-and third-borns; 
relatively, they deprive second-borns of attention and so contribute to their depen- 
dency. By contrast, Schooler (347) questioned the existence of the phenomenon, 
pointing out that if recently hypothesized biases in long-term population trends are 
taken into account, it may be the case that first-borns and second-borns are drawn 
from distinctively different populations and differ less because of their birth order 
than because of the different characteristics prevalent in populations choosing to 
have one child rather than more than one. It would seem that Schooler’s argument, 
while highly dependent upon some complex assumptions about population trends 
and attitudes about family size within different social classes, nevertheless is a 
familiar example of a general problem in biased sampling. No doubt the issue will 
recur in the future reports. 

Finally, Wozniak (435) cautions us not to misinterpret an entire source of re- 
search—Soviet studies of the development of verbal inhibition of motor behavior. 
The frequent failures to replicate such findings, noted in the literature, may be a 
result of misinterpretation of the usual Soviet methodology and of its underlying 
assumptions. In short, the caution suggests that in these cases, nationality correlates 
strongly with systematic psychophilosophical assumptions and a consequent meth- 
odology. 
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APPLICATIONS 


A total review of all child applications would te too lengthy to serve as a section 
of this chapter, nor would it be entirely appropriate; large sections of it would prove 
redundant with the chapters reviewing clinical and educational psychology. Never- 
theless, children are an extraordinarily important target of application; certain 
applications are virtually unique to children; amd some of those applications have 
direct or indirect implications for the concept cf development (as argued earlier). 
Consequently, a highly selective and very brief survey will be presented, beginning 
with the scholarly discussion offered by Atkinson & Paulson (7) on the general 
problem of translating learning theory into a cheory of instruction. Proceeding 
through a series of examples, these authors illustrated some generalizable proce- 
dures for deriving instructional strategies from learning principles. Significantly, 
their examples displayed the derivation of instractional sequences which take ac- 
count of the individual differences of their students, relative to the difficulty of their 
various curricular units. In general, they suggested an intersection between learning 
theory, computer technology, and programmed instruction, and some tactics spe- 
cific to that intersection which could lead to a radically different educational tech- 
nology in the future. The implications of such a development for children are 
immense. In that a developmental psychology is zonditioned by the culture within 
which the children it studies are socialized, and in that the culture’s schools may 
presently become a much more potent behavior-changing medium than at present, 
then future developmental textbooks may have to cite articles such as this as key 
historical points. 

At the same time that we cite careful data-based predictions about future child 
behavior problems, we must note the ongoing nonsolution of mary current ones. 
Spinetta & Rigler (380), for example, have gathered together the literature dealing 
with child abuse, especially that descriptive of the abusing parent. They point out 
several behavioral (or potentially behavioral) characteristics of the abusing parent, 
and thereby help to define a problem for modiication. In general, their survey 
suggests that abusing parents generally come fron deprived environments, possess 
incorrect beliefs about the nature of child rearing expressed in various malpractices, 
and have generally high rates of improperly discrzminated aggression of dangerous 
topography. Socioeconomic stresses were conside-ed contributing but not essential 
factors in the parent’s final descent into abuse. 

Nevertheless, experimental application has proceeded into a numter of fairly new 
settings. For example, Doke & Risley (99) and L=Laurin & Risley (237) reported 
two more details in an ongoing chain of experimental analyses of day care tech- 
niques. The first study demonstrated the conditicns necessary for maintenance of 
optimal child involvement when their activities ar2 to be prescribed and scheduled 
rather than allowed as free choices, and specified tactics allowing two teachers to 
maintain optimal child involvement even with = precisely specified sequence of 
activities. The second study compared two staffing patterns useful in accomplishing 
the transition from lunch through bathroom routines and into nap time, using the 
degree of child participation in the activities as tae criterion. A pattern in which 
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teachers staffed areas (the “zone” procedure) was found clearly superior to one in 
which teachers conducted specific groups of children through the sequence (the 
“man-to-man” procedure). 

A number of studies have pressed experimental applications directly into homes, 
placing the techniques of child management in the hands of parents. Christophersen 
et al (62) taught two sets of parents to set up token systems within their homes and 
use them to ameliorate a lengthy series of child behavior problems (household 
chores, personal grooming, backtalk, etc). In a very similar application, Hall et al 
(167) taught four parents to use various behavior modification techniques (several 
forms of reinforcement, mild punishment, and extinction) to improve such behav- 
iors as wearing correct orthodontic retainers, helping with household tasks, stopping 
crying, whining, and complaining, and getting dressed on time. Each parent was 
instructed to conduct an experimental reversal of the techniques used and then to 
recover the desirable behavior by reinstating the experimental practices. Herbert & 
Baer (178) taught parents to manage their social contingencies in a generalized 
correct manner, attending almost exclusively to all classes of desirable behavior by 
their child while ignoring almost completely all classes of undesirable behavior. The 
technique used involved the use of a counter on which the parent recorded all 
instances of attention to desirable behavior. (Recording errors—i.e. instances of 
attention to undesirable child behaviors—proved not to be an effective technique.) 
Those parents who profited by the technique seemed to be stable in their new 
practices after its discontinuation. 

Clark, Burgess & Hendee (65) moved child behavior modification into the na- 
tional parks and forests. They set up a reinforcement system for children in a 
campground, such that the children could eventually choose from a variety of 
readily available reinforcers. The responses required were the collection and proper 
disposal of litter in the campgrounds of the park. The experimenters went so far as 
to plant litter as part of the experiment to aid in the evaluation of the children’s 
efficiency. The technique appeared to be thoroughly effective in recollecting such 
litter. On a simple cost basis, it may represent an attractive solution to littering in 
such settings. 

Ritschl, Mongrella & Presbie (329) took a simple contingency in the school bus 
to maintain in-seat behavior (pointing out that out-of-seat behavior is dangerous in 
a moving bus). Their subjects were an unruly group of retarded (educable) children 
ranging in age from 6 to 15. Rock-and-roll music was played as long as the bus was 
moving and as long as no child was out of his seat. Each observed instance of 
out-of-seat resulted in 5 sec interruptions of the music. A marked effect was noted 
and experimentally analyzed as due to the new technique by experimental discon- 
tinuations of the procedure. (It is not yet clear, of course, whether the stimulus 
would work equally well with children in general, or if it is specific to retardates.) 

School applications are probably the majority case, but only two will be cited, 
chosen for their relevance to the perceptual development of children. Halliwell & 
Solan (168) have reported a perceptual training program for reading skills, suitable 
for first grade. They evaluated its effectiveness by comparing a group of children 
exposed to it with other groups given traditional (nonperceptual) reading instruction 
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as a supplement to their regular first grade curriculum, or else given no supplemental 
instruction at all. The perceptually based supplemental program led to better read- 
ing (primarily among the boys exposed to it) than. was seen in the other groups. The 
authors pointed out that perceptual training for reading is largely a result of the 
influence of developmental psychology. In the same vein, Tawney (392) presented 
a much more molecular analysis of some perceptaal responses involved in reading, 
specifically, letter discrimination. His training prcgram was based on an analysis of 
presumably critical features of letter shapes which facilitate their discrimination 
from one another. Using a match-to-sample paradigm, Tawney reinforced some 
children for discriminating and matching the critical features (presented divorced 
from letters) and others for responding to presumably noncritical features; in addi- 
tion, he utilized a control group who experienced neither program. A posttest of 
letter discrimination ability was then administerec. All groups improved over their 
pretests; but children who had received discrimimation training in the presumably 
critical features of letters did better than the o-hers (including those who had 
undergone discrimination training in supposedly noncritical features). A detailed 
analysis of the children’s errors gave some indications of the nature of letter confu- 
sions. Although letters are highly arbitrary visual stimuli, a successful program 
promoting their discrimination may well have implications for visual-perceptual 
development in general, at least in terms of a madel of analytic procedure. 
Generalization of desirable behavior changes, from the remediating environment 
to all other settings in which the child operates, is an enduring and difficult problem 
for application. It has been recognized as such onlyin recent years; more important, 
it has begun to appear as another item on lists cf behavior to change, as if “to 
generalize” were a response in itself. Consequentlr, techniques for developing “to 
generalize” have begun to appear; eventually, it seems, a technology of generaliza- 
tion will result. The past year’s research produced -wo notable examples. Johnston 
& Johnston (193) found that they had developed an effective token-mediated rein- 
forcement system for improving severe articulation problems in the speech of young 
children in a classroom setting. However, generalization of the improved articula- 
tion outside of the training setting was poor. A technique was developed which 
succeeded in producing generalized correct articulation. Children were trained in 
pairs, such that one child was a discriminative stimulus for the correct articulation 
of the other, and vice-versa. To the extent that the children shared many environ- 
ments other than the training setting, good articulatDon became generalized. Clearly, 
there remain several inviting problems, such as how to generalize control from a 
particular child to children in general and people in general. Walker & Buckley 
(406), coping with the same basic problem, evalua‘ed several different techniques 
for obtaining generalization after a remedial experieace ended. Their basic situation 
revolved around a remedial classroom, operating om a token reinforcement system, 
which produced desirable classroom behavior in difficult subjects. When the subjects 
returned from the remedial classroom to their regular classrooms, generalization 
was not always sufficient. Four groups of subjects were compared: a control group 
of remedial classroom graduates was simply returned to the regular classroom; 
another group was subjected to peer-reprogramming (in which children were taught 
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to apply appropriate contingencies to the subjects in the regular classroom); others 
returned to a classroom in which many stimuli had been equated to those encoun- 
tered in the remedial classroom; and the final group returned to a teacher who had 
meanwhile been subjected to training in behavior management techniques. At least 
some generalization was apparent in even the control group; but the two groups 
encountering equation of stimuli between the two settings and peer reprogramming 
did much better than that. Teacher training, however, was associated with no better 
results than were apparent in the control group. The peer reprogramming procedure 
was somewhat reminiscent of the Johnston & Johnston technique (just discussed), 
although more elaborate. In general, the two studies suggest practical and effective 
use can be made of peers, both as discriminative stimuli and as sources of reinforce- 
ment for behaviors to be generalized. 

Finally, it is pertinent to note a critical comment on one aspect of child behavior 
modification and a reply to it. Late in the year, Winett & Winkler (427) presented 
a review of school behavior modification applications, and concluded with an indict- 
ment succinctly stated in their title: “Current behavior modification in the class- 
room: Be still, be quiet, be docile.” The conclusion was derived from a survey of 
the research reported since 1968 in the Journal of Applied Behavior Analysis (a 
prime outlet for such studies). That survey indicated that inappropriate classroom 
behaviors invariably seemed to be those that were disorderly, loud, or disobedient 
of the teacher. It was suggested that behavior modification applications might well 
turn themselves toward different goals, e.g. the maintenance of “open” classrooms. 
A reply by O’Leary (299) accompanied the Winett & Winkler review. O’Leary 
suggested that their indictment was based on selective reviewing; he then presented 
a series of examples in which behavior modification techniques had been used for 
innovation and change rather than maintenance or strengthening of the status quo. 
Furthermore, O’Leary questioned whether the “open” classroom (in which children 
presumably sing, laugh, whistle, etc) was in fact known to be a desirable or effective 
learning environment for certain children, especially those with large social and 
academic deficits. However, O’Leary joined Winett & Winkler in warning against 
indulgent alliance with any request to change children’s behavior by any representa- 
tive of the educational establishment. In general, it would seem that the issue hinges 
very largely on knowing at the level of facts what behaviors lead to what conse- 
quences in classrooms. Unfortunately, the field of child applications is remarkably 
free so far of any empirical evaluations of whether education, socialization, or any 
other aspects of development proceeds better in quiet, orderly, obedient classrooms 
than in unruly, noisy, permissive ones. This situation, it appears, is about to change; 
factual investigations seem on the verge of appearance. Meanwhile, there is an 
abundance of opinion on both sides of the issue, so strong as to take on the appear- 
ance if not the substance of fact. 
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There is no surer prescription for a prolonged bout of humility than to accept an 
assignment to review a well-defined, neatly circumscribed area of psychological 
research. When the area has no clear demarcations, no discernible beginning or end, 
and can include virtually everything or almost nothing, humility becomes leadened 
with despair. The project was partially rescued by the editorial wisdom of a merci- 
fully small page allotment, but its final survival required a number of more or less 
arbitrary decisions regarding topics to be covered in the review. Conditioned appeti- 
tive drives were included because the topic had not been reviewed in this series 
recently and because of the opportunity presented by such a review to relate tradi- 
tional work in this area to highly related research efforts that in the past have 
dangled far apart. Conditioned reinforcement seemed a natural enough followup to 
conditioned appetitive drives, and in addition some interesting recent developments 
could be recounted. Por some reason the “contrafreeloading” phenomenon—the 
fact that animals often work for food when free food is available—has escaped 
attention in past reviews though it seems quite relevant to the topic at hand. 
Solomon & Corbit’s (129) opponent-process model of motivation is treated in some 
detail because it is a very recent development and because of the continuing dearth 
of fresh, sensible, and suggestive models of acquired motivation. The last section is 
a brief digression which acknowledges certain current upheavals in the field of 
learning and, in general terms, assesses their significance for developments in ac- 
quired motivation. 


CONDITIONED APPETITIVE DRIVES 


As pointed out by Brown & Farber (24), the possibility of conditioning such basic 
need states as hunger and thirst has long entertained psychologists interested in 


‘Preparation of this chapter was supported in part by National Science Foundation Grant 
GB-24386X. 

“The following abbreviations are used without prior definitions: CS (conditioned stimulus); 
CR (conditioned response); US (unconditioned stimulus); UR (unconditioned response); $+, 
S- (positive and negative stimulus, respectively, in a discriminative task). 
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acquired motivation. No purer realization of the Etter concept seems possible than 
to show that, through appropriate conditioning techniques, hunger and thirst can 
be produced in an organism at will. However, joining Cofer & Appley (33) and 
Bolles (17), Brown & Farber concluded from their review that “the reality of the 
phenomenon (conditioned hunger) remains to b2 established” (p. 107). Perhaps 
because of this consensus of skepticism—treinforced by a subsequent review and 
methodological analysis by Cravens & Renner (35}—as well as a general disenchant- 
ment with the concept of drive itself (3), the topic of conditioned appetitive drive 
vanished from the subsequent two reviews of derivzd motives (3, 19). Largely owing 
to the work of Seligman and his associates (95, 115, 117, 119), the subject now 
appears to deserve at least a temporary reprieve. 


Conditioned Thirst 


Where previous investigators managed to produce at best fragile evidence for condi- 
tioned hunger or thirst (35), Seligman and associates have reported conditioned 
drinking in massive amounts: 50 to 1000% increases in drinking observed in virtu- 
ally all of the 150 or more rats tested by them (119). Moreover, in sharp contrast 
to the evanescent character of earlier evidence of appetitive conditioning, they were 
able to produce conditioned drinking that was often virtually “inextinguishable.” 
The conditioning technique employed by Seligman and associates is simplicity itself. 

During a series of baseline sessions rats, deprived.of water for 1 hour, were placed 
into a distinctive test box in which water was readily available. There followed a 
number of conditioning sessions in which a solution of hypertonic saline (15%) plus 
procaine (2%) was injected subcutaneously, and afer some 15 minutes, the animal 
was placed into the test box for 45 minutes. Subcutaneous injections of 15% saline 
solutions are known to induce avid drinking in a matter of minutes (80, 141). 
Procaine, an anesthetic, was added in an attempt to alleviate the irritation caused 
by hypertonic saline injections (140, 141). It is important to note that during these 
conditioning sessions water was not available in the test chamber. At the termina- 
tion of conditioning, the animals entered an “extinc’ion” series during which, again 
1 hour water-deprived, they were placed into the test chamber with water available 
but with no prior injections. If thirst had been conditioned to the cues of the test 
chamber as a result of the conditioning experience, the animals should consume 
more water during extinction than they had durirg the baseline sessions, which 
indeed they did. 

It was not very long, however, before this pleasantly straightforward situation was 
disturbed by unforeseen complications. First, substantial conditioned drinking was 
produced by 1sotonic procaine injections (117; cf. 32). And though there was some 
indication that the combination of hypertonicity and procaine yielded more condi- 
tioning than procaine alone (117; cf. 34), it must have come as a surprise that 
unadulterated hypertonic saline injections yielded little, if any, evidence of condi- 
tioned drinking (119). Moreover, hypothalamic in ections of angiotensin, which 
produce rapid and marked unconditioned drinking (27), acted like hypertonic saline 
injections in producing at best only evanescent conditioned drinking (119). 
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The interpretation of this complex pattern of results offered most recently by 
Seligman and his associates is that there are two different mechanisms responsible 
for the conditioned drinking observed in their experiments. On the one hand, there 
is the classical conditioning of “real” thirst. “Stimuli paired with the very rapid 
induction of thirst caused by angiotensin injected into the hypothalamus can them- 
selves later produce increased drinking” (119, p. 119). The conditioned drinking 
based on this mechanism, however, is small in extent and quick to extinguish. The 
really striking conditioned drinking obtained in their experiments (cf. 139) is at- 
tributed not to classically conditioned thirst, but to operant conditioning of the 
drinking response motivated by the procaine component of the injections, which 
presumably acts as a “poison.” The assumption is that the procaine causes physical 
distress which is somehow relieved by ingestion of water. Because in their early 
experiments (115, 117, 119) water was not available in the test cages during condi- 
tioning sessions, it must be assumed that the animals learned the connection between 
water ingestion and alleviation of the procaine-induced distress by drinking water 
available in the home cages. In any event, the proposal is that “If the rat learns to 
anticipate illness and also learns that drinking relieves or prevents illness, it should 
continue to drink during extinction in order to prevent that illness” (95, p. 384). 
Going one step further, such conditioned drinking is extremely resistant to extinc- 
tion because drinking in this situation is essentially an avoidance response, and the 
parallel is drawn to the (alleged) high resistance to extinction of more conventional 
avoidance behavior (cf. 36, pp. 358-9). 

It appears that the evidence for “true” thirst conditioning is no less equivocal in 
the present series of studies than in earlier attempts. Although a small amount of 
conditioned drinking was reported when hypertonic saline solutions alone were 
employed (119), this result was marred by the fact that drinking was allowed during 
the conditioning trials. Moreover, in the very same report the authors state that “We 
have replicated this experiment without water available in the test cage during 
injections and have not found conditioning in the hypertonic groups” (119, p. 113). 
Similarly, the conditioned drinking produced by angiotensin injected into the lateral 
hypothalamus (presumably mediated by conditioned thirst) enjoyed the benefit of 
conditioning sessions in which water was freely available. And even then, as noted 
earlier, only a relatively small amount of conditioned drinking was observed, which 
dissipated quickly. Whether any conditioned drinking would occur if water were 
denied during the conditioning sessions remains to be determined. 

In a recent study, unfortunately lacking the controls necessary to be totally 
convincing, the suggestion was advanced that hypertonic saline-procaine injections 
result in a conditioned “nonspecific increase in motor excitability” (34, p. 219), 
which, if circumstances allow, can trigger eating and running as well as drinking. 
This is a significant methodological point. Our confidence that drinking was selec- 
tively aroused by their earlier conditioning experience would doubtlessly increase 
if, during extinction, Seligman’s animals had been provided with the opportunity to 
engage in dominant alternative activities. 
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Conditioned Hunger 


Typically, the conditioning of hunger has been attempted by subjecting a rat to some 
sort of food deprivation procedure and then placing the animal in a distinctive test 
chamber, on the assumption that by this procedure (backward conditioning?) hun- 
ger would somehow become classically conditioned to the cues of the test chamber. 
In a very recent attempt, the evidence obtained fo~ conditioned hunger again shows 
that, the use of modality-relevant CSs notwithstanding, “it is at best a rather 
ephemeral effect” (94). 

A somewhat more direct approach makes use of the observation that in man and 
animals exogenous insulin induces both hypoglycemia and hunger (21, 23, 64, 66). 
Since hypoglycemia can apparently be classically conditioned (2, 72, 148-150; but 
see 57, 122), the possibility of classically conditicning hunger by means of insulin 
injections immediately suggests itself. The indicat2d experiment, similar in concep- 
tion and design to the conditioned drinking studie», was performed by Balagura (9), 
who found that rats subjected to daily insulin injections during a conditioning series 
showed augmented lever pressing for food in a later extinction series in which 
normal saline was substituted for insulin. Like the conditioned drinking reported for 
hypertonic saline and angiotensin, the conditioned eating observed in this study was 
short-lived. 

Noting that Balagura allowed his subjects to werk for food during the condition- 
ing sessions, Siegel & Nettleton (123) later showed that augmented bar pressing did 
not occur in extinction if the animals had been denied the opportunity to lever press 
for food during the conditioning trials. They concluded that classical conditioning 
of hunger had not been demonstrated by Balagura, that the observed conditioned 
lever pressing and eating was an operantly conditioned “escape or avoidance re- 
sponse to the aversive stimulus of hyperinsulinisrı” (123, p. 391; cf. 86). 


Conclusions 


Despite continued and sometimes ingenious experimentation the case for classically 
conditioned appetitive drive states is still unimpressive. In part the problem lies in 
the vexing ambiguity of the concepts of hunger and thirst (50). Even a limited 
perusal of the relevant literature is sufficient to drive home the point that a large 
number of factors determine the eating and drinking behavior of animals and 
humans (26, 47, 49, 52, 74, 81, 98, 109, 131, 133, 151, 152). What subsets of these 
factors are to enter into the definition of hunger and thirst have never been ade- 
quately specified. As a consequence, it is not at aM clear what, if anything, would 
constitute ironclad evidence for the conditioning of hunger or thirst, especially in 
animals. 

“We agree with ... prior reviewers and believe that the term drive has been 
bandied about to the point where it is not only useless but is actually meaningless 
and confusing .. . [It] serves no scientific purpose and should be abandoned” (138, 
p. 54). Inasmuch as a scuttled ship takes down hull and superstructure alike, 
perhaps the terms conditioned hunger and conditioaed thirst ought to share the fate 
of basic drives. Rather than searching for conditiored appetitive drives, it might be 
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more profitable to turn our attention to how eating and drinking can come under 
the control of learning mechanisms, particularly classical and operant conditioning. 
Pursuit of this enterprise brings us in contact with areas of research that have for 
too long remained apart from the traditional conditioned drive literature. 


Classical Conditioning of Regulatory Eating and Drinking 


Substantial evidence exists that certain aspects of regulatory eating and drinking (i.e. 
eating and drinking related to metabolic needs) can be conditioned within a format 
suggestive of the classical conditioning paradigm. For example, Fitzsimons & Le 
Magnen (53) found that in a stable feeding situation the drinking patterns of rats 
were closely keyed to the future metabolic water needs generated by the size or the 
constitution of the meal. Because of the time course of the adjustments in water 
intake, they maintained that this “food-associated” drinking was not triggered by 
appropriate systemic regulatory stimuli (“primary thirst”) but rather was the result 
largely of the conditioning of drinking to some features of the ingested food (e.g. 
oropharyngeal stimuli). However Deaux, Sato & Kakolewski (39) sacrificed rats a 
few minutes after completing an accustomed 1.5 g meal of dry food (at the very 
instant that the animals commenced drinking) and found that their plasma osmolal- 
ity was significantly higher than that of unfed controls. They suggested that this 
systemic cue, rather than conditioning, could account for the initiation of food- 
associated drinking. Another possibility is that the increased plasma osmolality was 
itself a conditioned rather than an unconditioned response to the ingested food, 
developed in accordance with the classical conditioning model sketched below. If 
so, the drinking in question may be regarded as conditioned despite the presence 
of a relevant systemic cue. 

The conditioning of satiety [the regulation of meal size in accordance with its 
caloric content before this information can be systemically registered (20)] has also 
been reported in rats (20, 81). As pointed out by Booth (20), many of the earlier 
studies offered as evidence for conditioned satiety (e.g. 81) are susceptible to an 
interpretation in terms of conditioned taste aversions (105). His own studies, which 
avoided this difficulty, showed that the size of a feeding bout could come under the 
control of oral cues paired with different caloric densities. Moreover, the differentia- 
tion remained when two diets were offered alternately, equal in caloric value but 
tagged with olfactory or taste cues which formerly were associated with diets of high 
and low caloric density. 

A more general effect of oropharyngeal factors on satiety has been reported by 
Jordan (76), who in intragastric feeding experiments with humans found that sub- 
jects given intragastic food loads did not compensate fully for them by the amount 
that they consumed orally, nor were they as satisfied by the intragastric intake as 
by normal ingestion (cf. 137). “Apparently there is a separate oral aspect of satiety 
that remains unsatisfied in intragastric feeding” (76, p. 505). 

The precise nature of the processes involved in the conditioning of regulatory 
eating and drinking remains to be specified. Most accounts do little more than state 
that appropriate consummatory and satiety behaviors become anticipatory, or that 
they come under the control of oropharyngeal stimuli (20, 53, 91). If learned 
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regulatory eating and drinking are based on classical conditioning, two explicit 
models suggest themselves. In the first, taking fooJ-associated drinking as an exam- 
ple, the role of US is served by the systemic cues that regulate drinking (cf. 81), 
drinking itself being the UR. Then, in accordance with the classical conditioning 
paradigm, drinking moves forward as an anticipa:ory (conditioned) response to the 
CS of food and its ingestion. The second model views the systemic regulatory “cues” 
as URs to the metabolic conditions which produce them, just as hypoglycemia is 
a response to insulin. Through classical conditioning these systemic regulatory 
processes (or some fraction of them) become anticipatory (much in the way that 
hypoglycemia can be classically conditioned), and in the present case presumably 
elicit drinking. This, apparently, was the model =mployed by Balagura (9) in his 
attempt to demonstrate conditioned hunger. 

The notion that the conditioning of regulatory eating and drinking resides in the 
classical conditioning of appropriate systemic regulatory processes [either directly 
or indirectly (cf. 38)], rather than in the classical conditioning of consummatory 
behavior, enjoys certain advantages. For one thing, conditioned satiety is perhaps 
more easily handled by this formulation. Rather "han conceiving the CR as being 
the cessation of eating, as one must, given the first model, the second view requires 
only that the appropriate post-ingestive systemic: processes, perhaps including in 
part hyperglycemia (cf. 98), become conditioned in greater force to diets of higher 
caloric intensity. For another, the model makes ccntact with the conditioned taste- 
aversion literature, which, as pointed out below is not totally unrelated to the 
problem at hand. That certain neurohumoral and even “non-neural” physiological 
processes are capable of modification via classical conditioning seems likely (2, 72, 
148-150; cf. 29), although, as Gantt (56) points out, there are distinct limits to the 
conditionability of physiological processes. 

There are two other points that should be mentioned before leaving this topic. In 
the natural course of eating and drinking the cues zhat serve as conditioned stimuli 
for the control of regulatory eating and drinking are modality relevant, as they have 
been in related laboratory experiments (20, 81). Itis interesting in this connection 
that the lastest attempt at hunger conditioning using the traditional conditioning 
paradigm (94) produced some evidence for conditioning only when gustatory cues 
formed part of the CS complex. In view of the critical role played by modality 
relevance in the establishment of conditioned aversions (43, 58, 63, 121, 143), it 
perhaps is little wonder that the traditional conditioned hunger and thirst studies, 
most of which employed arbitrary CSs, produced such meager results. The second 
observation is that classical conditioning of reguletory eating and drinking extin- 
guishes fairly rapidly (20, 81), which is what one would expect from the functional 
requirements of these acquired behaviors. If conditioned satiety, conditioned drink- 
ing, and other such processes are to serve usefully im food and water regulation they 
must adapt adroitly to changes in diet and in other relevant conditions. 


Operant Conditioning of Nonregulatory Eatrig and Drinking 


That eating and drinking can come under the control of operant conditioning 
procedures hardly needs documentation. And, althcugh subject to special contraints 
-(14), consummatory behavior established by apprcpriate contingencies can be ex- 
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tremely resistant to extinction, as was perhaps illustrated by the conditioned drink- 
ing studies (e.g. 117, 139). The excessive eating by obese people doubtlessly is 
sometimes the result of operant conditioning (but see 99), which in turn has been 
used successfully to reduce eating to more acceptable levels (e.g. 100). Although 
interpretations differ (27), there can be little doubt that the profound depression of 
eating behavior often observed in anorexia nervosa, which has a mortality rate as 
high as 15% (15), is frequently shaped and maintained by operant contingencies. 
As in the case of obesity, behavior modification based on operant conditioning 
principles has been successfully used in restoring more acceptable eating behavior, 
often with striking weight gains (11, 15, 112). 

A tempting, though clearly premature, generalization is that classically condi- 
tioned eating and drinking behaviors are likely to be regulatory whereas operantly 
conditioned consummatory behavior tends to be nonregulatory. By anticipating the 
post-absorptive effects of ingested food and water, classical conditioning enables the 
organism more quickly and more accurately to adjust its consummatory behavior 
to its metabolic needs (20, 22, 53, 81, 91). 

The potential for operantly conditioned consummatory behavior to serve non- 
regulatory ends resides in the fact that such acquired behavior can rather easily 
come under the control of stimulus conditions far removed from metabolic needs 
and can be reinforced by consequences which bear little relationship to the satisfac- 
tion of metabolic requirements, such as parental approval, anxiety reduction, and 
the like (15, 100). The despair expressed from time to time by concerned nutrition- 
ists with regard to the American diet, both with respect to quality and quantity, is 
testimony of how far consummatory behavior can drift from regulatory ends. 

Extending the classical conditioning model a bit further, perhaps conditioned 
taste aversions based on vitamin-deficient, or poisoned, diets may also be viewed as 
classically conditioned regulatory eating and drinking. According to this view, the 
illness that results from ingestion of vitamin-deficient or contaminated food is 
characterized by certain physiological processes which serve as the URs. Some 
portion of the latter become conditioned to the oropharyngeal cues produced by the 
defective diet, becoming anticipatory and thereby inhibiting consummatory behav- 
ior (cf. 106). In a recent study bearing directly on this point, Zahorik (155) worked 
with taste aversions induced by thiamine deficiency, one of the most prominent 
symptoms of which is bradycardia. She found that rats, either in a thiamine-deficient 
or a recovered state, drank more of a “control” solution than of one that had been 
paired with thiamine deprivation, and drank still more of a third solution that had 
been paired with thiamine injections [known to produce rapid temporary recovery 
from some deficiency symptoms (155, 156)]. However, the important point is that 
the animals’ heart rates reflected their drinking behavior, being slowest in the second 
condition and most rapid in the third, as if the bradycardia characteristic of thia- 
mine deficiency became classically conditioned to the solution with which it had 
been associated and the increased heart rate which attended recovery from thiamine 
deficiency became conditioned to the solution with which it had been paired. Al- 
though these results offer some support for the classical conditioning model outlined 
above, they are in clear need of verification and extension. 

' Perhaps a sharper distinction between classical and operant conditioning has been 
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drawn than can be justified by recent empirical and theoretical developments (12, 
16, 18, 93, 132; but see 40). Nevertheless, the distinction may still be of heuristic 
value in sorting out the learning mechanisms ~vhich contribute to the control of 
regulatory and nonregulatory consummatory behaviors. 


Conditioned Aversive States 


Although the relevant literature continues to rzount, because of space limitations 
this topic will not be covered. Apart from the wua] journal sources, detailed treat- 
ment of various aspects of the topic can be fcand in a number of recent edited 
volumes (13, 28, 62, 92). 


CONDITIONED REINFORCEMENT AND THE INFORMATION 
HYPOTHESIS 


The quest for the essential nature of positive co-ditioned, or secondary, reinforce- 
ment is still very much alive. According to the classical view of conditioned rein- 
forcement, appropriately pairing a previously neutral stimulus with primary 
reinforcement is a sufficient condition for establishing a secondary reinforcer, which 
thereafter functions, at least for a time, in muck the same way as a primary rein- 
forcer. The pros and cons of this interpretatiom have already been dealt with in 
considerable detail and need not be recounted he- (69, 85, 142). A strong challenge 
is offered to the traditional view by what may oe referred to as the information 
hypothesis; this holds that neutral stimuli acquir= reinforcing properties not simply 
because they have been paired with primary reinforcement but rather because they 
provide relevant information about such events. Thus, for example, a stimulus may 
consistently precede the presentation of food but if its information is redundant, in 
the sense that the same information is conveyed by an immediately antecedent 
stimulus, it may develop little in the way of conditioned reinforcement properties 
(44, 45). The information hypothesis of conditicned reinforcement has gained in- 
creasing support in recent years, though by no r-eans to the extent of eradicating 
competing interpretations (41, 103). A nakedly cognitive theory of conditioned 
reinforcement has been advanced by Longstreth (£5), who went beyond the informa- 
tion hypothesis in arguing that secondary reinforcers are really only means for 
securing desired rewards in the future. “People cD not work for money because it 
has been paired with rewards in the past. They ork for it only if they believe it 
will bring rewards in the future” (p. 76). 

Despite the crossfire of conflicting opinions regarding the conceptual status of 
conditioned reinforcement, cultivation of its emfirical basis continues. Thus, for 
example, token reinforcement in kindergarten Zhildren has been demonstrated 
within a setting that eliminates an interpretation in terms of response facilitation 
exerted by the presence of the tokens themselves. Children took longer to extinguish 
pressing a button that had delivered a token pre-iously paired with trinkets than 
a button that earlier produced a token with no such prior history (128). Although 
this study was apparently motivated by the traditional view of conditioned reinforce- 
ment, it seems vulnerable to the cognitive interpr2tation suggested by Longstreth 
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(85). Perhaps the children pressed the first button longer simply because they 
believed they might subsequently be allowed to exchange any tokens they earned 
for trinkets. 

Be that as it may, the cognitive hypothesis, which emphasizes the subject’s ability 
to reason, would have to be pushed a bit to account for the establishment in 
10-month-old infants of a spatial discrimination solely on the basis of conditioned 
reinforcement (124). Infants were first exposed to a short training period in which 
each time they touched an illuminated target on a feeding table a tone was sounded, 
followed shortly by the delivery of a Froot Loop reinforcement; a second, distinc- 
tively different, tone was also presented but not in association with reinforcement. 
Two target stimuli were then presented, one of which when touched produced the 
“positive” tone (but not Froot Loops), the other, the “negative” tone. The infants 
showed a clear preference for touching the target that produced the positive tone. 
As in other studies using the same general paradigm, there may be a question as 
to whether the contro] exhibited by the positive tone should be attributed to a 
reinforcing or to an eliciting function of that stimulus. This is one of a number of 
troublesome methodological points that must be kept in mind when designing 
studies aimed at demonstrating conditioned reinforcement effects (85, 142). 

Turning to developments in methodology, Albert & Mah (1) recently reported a 
novel method of illustrating one-trial learning based on conditioned reinforcement. 
Water-deprived rats were first habituated to a bare black box which contained a 
pyramid-shaped nook. An empty drinking spout (of the same kind used in the home 
cages) was then inserted into the nook and the animals allowed to explore the spout 
for a maximum of 15 seconds. During a 5-minute test session given 48 hours later 
with the water spout removed, these animals showed much more exploration within 
the area of the nook than control subjects. This very simple experimental task was 
also shown to be sensitive to motivational variables, in that the water deprivation 
condition at the time of testing proved to be quite important in determining explora- 
tion of the nook area, while deprivation conditions during training, and the amount 
of time spent by the animals exploring the empty spout during training, were less 
effective variables. Because it produces clear evidence of learning with an exposure 
“trial” of only a few seconds, this paradigm might prove useful in areas other than 
conditioned reinforcement, e.g. memory consolidation. Somewhere at the opposite 
end of the complexity continuum, schedule interactions within the token-reinforce- 
ment paradigm have been investigated in rats, with ordinary glass marbles serving 
as tokens (136). [See (69) for a useful review of the methods currently in use for 
studying conditioned reinforcement.] 


Conditioned Reinforcement and Observing Behavior 


As already indicated, the conflict between the traditional analysis of conditioned 
reinforcement and the information hypothesis can be reduced to the question of 
whether stimuli achieve reinforcing properties because in the past they occurred 
contiguously with positive reward or because they provide information regarding 
subsequent reinforcing events. [Some writers, it should be pointed out, apparently 
allow for both pairing and information in determining the strength of a conditioned ` 
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reinforcer (e.g. 145).] To its credit, the traditional interpretation is reasonably 
objective in specifying the conditions under which a previously neutral stimulus 
achieves the status of a conditioned reinforcer. On the other hand, in spite of 
demonstrated applications of formal informaticn theory to psychological research 
(e.g. 4, 60), the information hypothesis of condi-ioned reinforcement has developed 
largely out of an intuitive grasp of informational concepts, although there are signs 
that this situation is changing (16, 147). There & also a definite movement afoot to 
extend the scope of the information hypothesis from conditioned reinforcement to 
a much wider range of phenomena, including >artial reinforcement effects (145), 
classical and operant conditioning, and their interactions (16). 

A major impetus for the application of information concepts to conditioned 
reinforcement has come from investigations of otserving behavior. These are studies 
in which the presentation of discriminative stimuli (S+, S-, or both) are made 
contingent upon the performance of an “observing” response. In a pioneer study 
verified many times, Wyckoff (153) found that pigeons would emit substantial 
numbers of observing responses which served to produce the discriminative stimuli 
of a multiple schedule (which otherwise became a mixed schedule). He attempted 
to accommodate this result within the traditional interpretation of secondary rein- 
forcement by assuming that the strength of a positive conditioned reinforcer is 
related to percentage of reinforcement by an increasing exponential function (154). 
Since that time, however, there have been numer us attacks on Wyckoff’s analysis, 
all suggesting that the information provided by the stimuli produced by observing 
responses (or the attending uncertainty reduction, constitute the source of reinforce- 
ment for observing behavior (e.g. 37, 83, 146; cf. 68). 

One issue that has recently been brought into focus by the rival viewpoints is the 
degree to which S— can maintain observing behavior. An unmoditied information 
hypothesis would have to acknowledge that if S~ produces as much information (i.e. 
as much uncertainty reduction) as $+, it should sustain observing behavior to the 
same degree as the latter stimulus. Quite the ccntrary deduction flows from the 
traditional interpretation of conditioned reinforcement, in which S- qualifies as a 
secondary aversive stimulus, one which very like y would tend to punish immedi- 
ately preceding responses. The empirical data regarding this issue still are not clear. 
On the one hand, there have been reports that observing behavior can be maintained 
when its only consequence is to produce S- (82, 110). These results have been 
criticized, however (41), even by critics friendly to zhe information hypothesis (145). 
Moreover, Dinsmoor, Browne & Lawrence (41) reported just the opposite result, 
that when only S- was produced by observing responses, the latter declined to 
near-zero levels. 

A less direct attack on the same problem takes advantage of the fact that under 
appropriate circumstances the average amount of information available in the dis- 
criminative stimuli (S+ or S-) is related to the probability of S+ (and reward) by 
an inverted U function, symmetrical around 0.50. Thus if observing responses are 
reinforced by the quantity of information they prcduce, rather than by the nature 
of the information (“good news” vs “bad news”), observing response rate ought to 
produce a symmetrical, inverted U function when probability of S+ is appropriately 
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manipulated. This seems not to be the case. Wilton & Clements (147) manipulated 
probability of S+ (and primary reward) from 20 to 80% and found a higher rate 
of observing responses in the latter condition. They suggest that the asymmetry 
arises because the same amount of information has different reinforcing effects on 
the observing response depending upon whether it signals reward or nonreward. As 
the authors point out, an asymmetrical function relating observing response rate to 
probability of S+ (and reward) is presumptive evidence that more than the informa- 
tional content of the discriminative stimulus determines its ability to reinforce 
observing behavior. But by the same token, such a function does not necessarily 
mean that S~ has zero positive reinforcing ability. More than likely, we are dealing 
with a common situation in behavioral research, namely, any one of a variety of 
results can be obtained depending upon the particular parameter values chosen. In 
the present case it is likely that the reinforcement schedules associated with S+ and 
S-, scheduling of the observing response, the duration of discriminative stimuli 
produced by each reinforced observing response, the level of deprivation, and possi- 
bly other variables, determine the degree to which S- contributes to the maintenance 
of observing behavior. Trying to establish that S- never reinforces observing re- 
sponses will probably prove fruitless. Good news Ze probably more reinforcing than 
bad news, but if the information provided by bad news were entirely without positive 
reinforcing effect, weather forecasters, at least in the part of the country in which 
I reside, would long since have gone out of business. 


Preference for Signaled Reinforcement 


There are a large number of studies, closely related to the observing response 
research in all but terminology, which clearly show that animals and humans very 
generally prefer signaled to unsignaled reinforcement. To cite several of the most 
recent of these experiments, rats prefer signaled (positive) electrical brain stimula- 
tion to unsignaled stimulation (30, 31), and humans, bathed in hot, humid, air from 
the chin up, prefer signaled to unsignaled 15-second cooling air puffs delivered to 
the face (54). Turning to aversive stimuli, rats prefer signaled to unsignaled shock 
whether or not the shock is escapable (5, 6), and show the same kind of preference 
in a shock avoidance task (8). The greater aversiveness of unsignaled over signaled 
shock has also been verified using a response rate measure (87). The magnitude of 
the difference in the aversiveness of the two types of shock is indicated by the fact 
that in one study rats chose signaled shock 4 to 9 times longer in duration than 
unsignaled shock of the same intensity, and with duration held constant they chose 
signaled shock 2 to 3 times more intense than unsignaled shock (7). Although the 
consensus is strong, preference for signaled reinforcement is not a unanimous 
finding. Hershiser & Trapold (70) reported that their rats consistently preferred 
unsignaled to signaled reinforcement, which took the form of sucrose solution 
injected directly into the mouth by means of a permanent oral fistula. Apparently, 
however, such exceptions are rare. 

Taken as a whole, these experiments, particularly those dealing with aversive 
stimulation, seem to offer striking support for the information hypothesis. In many 
of these studies the subject had two classes of responses available, one of which led 
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to signaled reinforcement and the other to a physically equivalent reinforcement but 
minus the signal. The indicated conclusion, frcm the point of view of the informa- 
tion hypothesis, is that the first response class vas chosen over the second because 
of the extra reinforcement provided by the infcrmation residing in the signal. The 
information hypothesis can also point to a number of situations in which a stimulus 
signaling an aversive event was eagerly sought out, perhaps producing at last the 
needed evidence that bad news can be stronglr reinforcing. 

Unfortunately, the situation is not that simple. In the first place, the signal, apart 
from any reinforcing value it might possess, appears to influence the subjective value 
of the primary reinforcement itself. This is suggested by the rat’s preference for 
signaled shock even at the cost of far longer o7 much more intense shocks (7); it 
seems unlikely that the marked disparity in the physical values of the signaled and 
unsignaled shocks could be completely compen:zated for by the reinforcement sup- 
plied by the information in the signal, i.e. by uncertainty reduction. The same point 
is illustrated in the case of positive reinforcemeat by the finding that an otherwise 
nonreinforcing electrical brain stimulation can te rendered reinforcing by having it 
preceded by a brief signal (31). One could pernaps complement the information 
hypothesis with a mechanism such as Perkin:’ (101) “preparatory responses,” 
thereby accounting for both the reinforcement nesiding in informative stimuli and 
the effects that such stimuli sometimes have on the subjective intensity of subsequent 
primary reinforcing events. It is questionable, though, whether given a satisfactory 
explanation of the second class of phenomena, any need would exist for a separate 
accounting of the first. Further eroding the apparent support for the information 
hypothesis, some investigators have argued that the important informative stimulus 
in signaled aversive paradigms is not the signal keralding the advent of shock, but 
rather its absence which, serving as a cue for nonshock periods, constitutes a 
“safety” signal (7; cf. 70). So there is a questioa about how firmly these studies 
support a positive reinforcing role for bad news 

Rescorla (103) has recently taken aim at the information hypothesis as that 
conceptual system has been applied to Pavlovizn conditioning. He pointed to a 
number of developments, all seeming “to suggest that by itself contiguity between 
a CS and US is insufficient for Pavlovian conditicning. Rather, for a CS to become 
conditioned, it must in some sense provide ‘information’ about the coming of the 
US” (p. 2). Rescorla went on to analyze the implications of these suggestions, 
ultimately coming to the conclusion “that althouzh it may sometimes be useful to 
describe conditioning in informational terms, in tre end that intuition is incomplete 
and incorrect” (p. 43). As a substitute for informational concepts, he presented a 
quantitative theory that was developed jointly wth Wagner (e.g. 104). Although 
some may fee] that the theory bears a distressing resemblance to Hull’s classical 
learning theory and to first-generation mathemat:cal models, it does a suprisingly 
good job of accounting for at least some of the results (e.g. blocking) that appear 
to call for an informational analysis of Pavlovian conditioning. 

If the informational analysis of conditioned reirforcement, Pavlovian condition- 
ing, and related behavioral processes (16) is to resis . assimilation by more traditional 
behavior theory, it will have to show that its scope of application is at least as wide 
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as that of the latter. To accomplish this expansion it probably will have to assign 
reinforcement value to stimuli, not only on the basis of the amount of information 
transmitted, but also in terms of the nature of the event addressed by the informa- 
tion. Surely not all information (defined in technical terms) is equally valuable. The 
cost of achieving the information must also be considered. Not many people would 
trouble to learn the time of day if they had to purchase a Tiffany wristwatch each 
time to do so. It is also important to learn whether the reinforcement function 
assigned to uncertainty reduction can, in the interests of parsimony and objectivity, 
be transferred to more peripheral processes, such as preparatory responses, tension 
reduction, and the like. 

Several years ago this reviewer (37) reported that when monkeys were either 
extinguished or reversed on a simple discrimination problem they increased enor- 
mously their rate of observing responses which, because both S+ and S- followed 
each observing response, produced only redundant information. In informational 
terms it looked as though the nonreinforcement that suddenly accompanied the 
onset of extinction or reversal induced a state of uncertainty in the animals which 
they attempted to reduce by a search for relevant information. (Frustration seemed 
unable to account for the increase in observing behavior.) This is the kind of result 
that still poses problems for traditional reinforcement views, and the kind that must 
be handled in considerable detail by the informational analysis if it is to prove a 
viable alternative. 


THE WORK ETHIC IN ANIMALS AND CHILDREN 


A number of studies have reported that rats (both normal and with septal or 
ventromedial hypothalamic lesions), pigeons, and children (Caucasian and Ameri- 
can Indian) will perform an instrumental response to obtain precisely the same 
reward that is available to them with no work requirement. Rats will run down an 
alley tripping over hundreds of food pellets to obtain a single, identical, pellet in the 
goal box (134), press a lever on various schedules to obtain pellets of food which 
can be had without work in the nearby vicinity (32, 75, 96, 125, 126), and pigeons 
will peck a key, even on intermittent schedules of reinforcement to secure exactly 
the same food that is freely available in a nearby food cup (96, 97). Given the option 
of receiving marbles merely by waiting an equivalent amount of time for their 
delivery, children tend to prefer to press a lever (on a fixed-ratio 10 schedule) to 
obtain the same marbles (125, 127). 

As noted by Taylor (135), there are two related but separable issues involved in 
these work vs “freeloading” studies. With reference to animal subjects, the first is 
whether, and to what extent, animals will work for food when identical food is freely 
available; the second, which is really a special case of the first, is whether animals 
prefer to work for food rather than to freeload. If verified, this stronger version 
places animals, and by implication humans, in a far more sanguine light with regard 
to their motivational systems than heretofore suspected by many. 

That animals and children will indeed spend part of their time working for the 
same rewards that are available with no work requirement is unquestioned, even by 
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Taylor (135). And something is known about the conditions that modulate this 
phenomenon: (a) The instrumental behavior is mot maintained simply by “intrinsic” 
reinforcement, for if the instrumental response fails to produce the reward object, 
its rate drops off precipitously in pigeons and rats (96) and children bar press less 
(127). (6) Increasing the work requirement (32 but see 97, 126) or increasing the 
rate of delivery of free food (125) increases frzeloading. (c) On the other hand, 
higher levels of training on the instrumental response tend to reduce the amount of 
subsequent freeloading (75, 134). (d) Prior operant training is not required, how- 
ever, for pigeons living in an experimental chamber with food continuously available 
will acquire and maintain (for at least 20 days) a key-pecking or lever-pressing 
response which produces identical food (96). (e) Finally, in the single study that 
employed water as reward, rats showed very little disposition to work for water 
when free water was available (135). 

The evidence that animals prefer work over fr=eloading, in the sense that during 
a given experimental period they consume more earned than free food, is less firm 
(135). Yet, striking instances have been reported 32, 75, 125). An analogous prefer- 
ence in children (for marbles) has been observec by Singh (125, 127). Apart from 
their potential significance for certain touchy social issues, these results are interest- 
ing because they seem to be counter-intuitive ard counter-theoretical, at least for 
many behavioristic theories (96, 125). 

Following Kavanau (77), Singh (125, 127) kas proposed that controlling the 
environment, that is, “manipulating, and altering relationships with any susceptible 
environmental features” (77, p. 1623) is an impor-ant factor in behavior determina- 
tion. Animals enjoy a substantial degree of contrel over their natural environment, 
reflected in such activities as obtaining food, builcing and maintaining nesting sites, 
avoiding and escaping predators, and so on. Exer-ing control over the environment 
is so essential to the survival of animals that it is not implausible that exercising such 
control is itself reinforcing. Consequently, if in a laboratory situation an opportunity 
for environmental control presents itself, an animal will ordinarily take advantage 
of it, as in the work vs freeloading studies. One imolication of this hypothesis is that 
if means of controlling the environment were prcvided in addition to working for 
food [such as manipulating the illumination level (77)], animals will enage in these 
activities at the expense of working for food and hence show a greater propensity 
for freeloading. Apparently, no such study has ben reported. 

Another possibility is that the two sources of reward available in freeloading 
studies, although physically identical, are not perceived as identical by the animals. 
The very same wine often tastes better when drurk from a stem glass than from a 
paper cup. Perhaps there is something about the temporal sequencing (or other 
properties) of earned food that differentiates it from free food. Assuming that 
animals prefer some degree of variability in their ciets, one would expect that they 
would occasionally work for food even though free food was available. However, 
a preference for earned food cannot be explained on this basis alone. It might be 
interesting, nevertheless, to tag the earned and free food with different odors or 
flavors, which if the present hypothesis has substance, ought to decrease freeloading. 

Obviously, a great deal remains to be learned about “contrafreeloading” (135), 
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its persistence (most demonstrations have involved only a small number off ses- 
sions), its relationship to exploratory and manipulatory behavior (46), to autoshap- 
ing, or more generally behavior maintained by noncontingent reinforcement (25, 55, 
111, 132, 144), and to the phenomenon of “self-reinforcement” which, recently 
demonstrated in pigeons (90), is a self-management technique currently employed 
in behavior modification (89). 


AN OPPONENT-PROCESS THEORY OF ACQUIRED 
MOTIVATION 


Anyone consulting the reviews of derived motives in this series for enlightenment 
regarding the possible basis of human acquired motivation must have come away 
sorely disappointed. The vast majority of studies cited in these papers relate to 
animal, and in particular rat, research. The reasons for this emphasis are not hard 
to find. Most of the reasonably rigorous conceptualizations of acquired motivation 
have, after all, derived from traditional learning theory, which was itself heavily 
rooted in rat research. Moreover, such motivationally relevant concepts as condi- 
tioned appetitive and aversive drives, incentive motivation, and conditioned rein- 
forcement did not develop because of a basic concern with motivation itself but 
rather as a response to the needs of an expanding theory of learning. It is little 
wonder, then, that these concepts have never been forged into a convincing theory 
of acquired motivation. 

Quite recently, however, Solomon & Corbit (129) have proposed an opponent- 
process theory of motivation which, being stimulated by problems of motivation 
rather than of learning, makes contact with virtually the full range of acquired 
motivational phenomena, from the “joy” experienced by dogs when an accustomed 
strong aversive event is terminated, to the abstinence agony that torments addicts 
upon drug withdrawal. Because of its potential theoretical significance and its 
relevance, the model and its applications will be described here in a little detail. 

An important feature of the model is the assumption of hedonic or motivational 
homeostasis, hardly a novel innovation. (In the current version of the model hedonic 
value, emotion, affect, motivation, and related concepts are not distinguished.) What 
is novel is the nature of the mechanism that is postulated to accomplish hedonic 
homeostasis. In effect, much of what is considered acquired motivation 1s created 
from the failure, or at least the over-zealousness, of the homeostatic mechanism. 
Their notion, in brief, is that whenever a significant departure from hedonic equilib- 
rium occurs due to the presentation of an affect-producing stimulus, a negative 
feedback system is activated which generates “an affective process which is, in some 
abstract sense, the opposite of that precipitated by the stimulus input.” Thus we 
have a primary affective or motivational process, also referred to as affective state 
A, which is elicited by an appropriate stimulus situation; elicitation of state A leads 
immediately to a secondary affective process which, because it is opposite in affective 
value, is termed an opponent process. The affective state identified with the oppo- 
nent process may be referred to as state B Although the opponent process is 
assumed to be “sluggish in its latency, recruitment, and decay,” with repeated 
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elicitation of state A it is assumed to grow in intensity, occur with shorter latency, 
and decay to affective zero less rapidly. Given these assumptions, it is not difficult 
to see that with proper parameter values the opponent process could overshoot its 
target of establishing hedonic equilibrium and overwhelm the primary affective 
process, whether it be positive or negative. 

To cite an illustration given by Solomon & Corbit, dogs initially subjected to 
strong noncontingent shocks behave in such a way as to suggest a state of terror 
(extreme negative affect) while shock is presen:, reverting to a state of stealth, 
submission, and caution upon termination of shock. After the dogs have had consid- 
erable experience with shock, a dramatic change takes place in their behavior. Upon 
application of shock, terror is replaced by a far less aversive state (unhappiness, 
anxiety), and when shock is terminated, a distinctly positive affect state seems to 
result, the animals appearing euphoric, active, social and happy. The opponent- 
process model manages to accommodate these striking changes in behavior, and in 
inferred affective states, by its assumption that rep2ated application of shock (result- 
ing in repeated elicitations of state A) serve to strengthen greatly the opponent 
process (state B). As a result, the negative affect generated by the shock stimulus 
is greatly attenuated and upon termination of shock, because state A is thereby 
eliminated and only a greatly exaggerated state B remains, the animals are left in 
a state of euphoria. 

This same type of interaction between a primary affective process and an immedi- 
ately reacting secondary opponent process can ze applied to a wide variety of 
phenomena in the general area of acquired mo-ivation. Parachutists frequently 
experience considerable fear on initial jumps and st the termination of a jump may 
appear “stunned” and “stony-faced” (129). After much experience, however, during 
which the opponent process gains in strength, the terror associated with the jump 
gives way to tense expectancy, and at the termination of the jump, which leaves only 
a heightened state B to express itself, feelings of exhilaration and jubilation result, 
often lasting for considerable lengths of time. [By way of comparison, Epstein and 
Fenz’s anxiety inhibition principle (cf. 48, 51) can account for the reduction of fear 
in experienced parachutists but not for the develcpment of positive affect.] 

Cigarette and drug addiction are handled in much the same way, only in these 
cases state A has positive affect and state B is an unusually long-lasting negative 
affect state. Since another assumption of the model is that state B can be eliminated 
or reduced by reinstitution of state A, addiction may be viewed as behavior in the 
service of escaping an aversive state B by inducinz (via cigarettes, drugs, etc.) its 
antidote, state A. Unfortunately, however, grossly unlike antidotes, the reinstitution 
of state A only means that state B will become that much stronger. Thus the addict 
is slowly crushed under an opponent process which becomes more massive each time 
it is temporarily dislodged. 

There are other features of the model that are wcrthy of note. Although changes 
in the strength of the opponent process forms the core of acquired motivation within 
the model, there is a second way in which learning enters. Both state A and state 
B are assumed to be conditionable by Pavlovian conditioning techniques, though 
quite different consequences are associated with the conditioning of the two states. 
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The major difference is that a stimulus which, through conditioning, is capable of 
eliciting state A will have a “biphasic” effect. It will first elicit state A and, because 
of the unconditioned coupling of the primary and opponent affective processes, state 
B will also make an appearance. This implies that when state A is negative, condi- 
tioned elicitation of state A will result in a negative hedonic state followed by a state 
of positive affect, e.g. “anxiety” followed by “relief.” Conditioning of B states leads 
to no such reversal of affect. 

Because the model applies equally to positive and negative affect states, the 
possibility exists for developing an exaggerated opponent process to a negative A 
state. In such a case, an individual might deliberately inflict an aversive A state upon 
himself in order to achieve the associated positive B state. The potential of this 
implication for interpreting behaviors ranging from acquired tastes for initially 
aversive substances to outright masochism should be apparent. 

Although it is far too early to judge the theoretical and pragmatic fruitfulness of 
the opponent-process model, it seems to incorporate the kinds of mechanisms that 
can be articulated into a viable structure of acquired motivation, if not for the entire 
class of such phenomena, at least for an important subset. Whether addictive behav- 
ior can be meaningfully handled by the theory is perhaps open to question. Solomon 
& Corbit (129) have some interesting things to say about cigarette addiction and its 
treatment, an area greatly in need of fresh insights (10, 71). On the other hand it 
may be stretching things a bit to apply the opponent-process model to drug addic- 
tion. A major assumption of the theory is that the opponent process (state B) is 
weakened by disuse. Consequently, if a heroin addict could by some means be 
prevented from taking heroin for an extended period of time, his craving for the drug 
(state B) should ultimately dissipate, i.e. he should no longer be an addict. This 
seems not to be the case. 

Dole (42), a pioneer of methadone maintenance treatment, points out that addicts 
who have been on methadone maintanance for as long as three or four years, during 
which time their craving for heroin is apparently extinguished, almost always expe- 
rienced “heroin hunger” upon withdrawal of methadone. “They felt that they were 
so far away from drug addiction and so far beyond wanting heroin anymore that 
they would have no need for methadone. For them the return of a heroin hunger 
was terrifying” (p. 24). Dole (42) also observes, “Addicts who have been locked up 
for 5 years in jail, totally abstinent, are likely to get re-addicted within the first 
month after they are out. I believe one has to reckon with the fact that heroin hunger 
is probably a symptom of a pharmacological imprint that may last for a man’s life” 
(p. 24). 

Possibly the opponent-process model can accommodate permanent addiction, 
when it occurs, by postulating that extra-potent B states may not be completely 
reversible. However, this assumption [like the closely related “principle of partial 
irreversibility of anxiety” used by Solomon & Wynne (130) in a completely different 
context] is vulnerable to ad hoc applications. Despite this and other potential 
problems, the opponent-process model is likely to figure importantly in future 
theoretical and research developments in acquired motivation. 
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LEARNING, REVOLUTION, AND DERIVED MOTIVATION 


Since derived motivation, by definition, rests on acquired mechanisms, it cannot 
avoid being affected by significant conceptual and empirical developments in learn- 
ing. It will not come as a surprise to anyone who still reads the relevant literature 
that the area of learning is in the midst, or perhaps at the tail end, of a pair of 
revolutions, which, although starting from quite different bases, sometimes share a 
common front. Like all other revolutions, these two have well-developed roots in 
the past and both have caused more heads to roll than was perhaps necessary. 

The major impact of the first of these revolutions has been to reinsert biology and 
evolution into the learning equation. The divergent lines of inquiry that successfully 
mounted this effort are nicely represented in the recent volume Biological Bounda- 
ries of Learning (116), to which may be added Razran’s monograph (102) and a 
spate of recent articles which converge on the topic from a variety of angles (18, 
49, 56, 59, 61, 79, 106, 113, 114, 120, 132). This development finally came to a head 
because we could no longer afford merely to nod to certain facts, among them, (a) 
that when it comes to serving as discriminative stimuli in learning situations not all 
stimuli are equivalent, or even approximately so—the principle of nonequivalence 
of stimuli; (6) that depending upon the response system chosen, the effectiveness 
with which instrumental behavior is acquired can vary enormously—the principle 
of nonequivalence of responses; and (c) that there is a profound difference in the 
readiness with which associations can be formed, particularly over long temporal 
intervals, which seems to depend upon the “modality” of the association [see (120) 
for a detailed review of the evidence relating to these issues]. Another contributing 
factor was a decreasing confidence in reinforcement as an agent which impartially 
and mindlessly strengthens selectively the most recent behaviors and in the assump- 
tion that it holds an exclusive franchise on this function (e.g. 18, 132). 

All of the nonequivalences made sense when looked at from the point of view of 
the biological and behavioral endowments of the organism as handed down by 
evolution and natural! selection; and, of course, all had been recognized for some 
time inside and outside the area of learning. Why they, and certain closely related 
issues, suddenly erupted to alter theoretical conceptions in learning to an extent that 
makes the term “revolution” quite proper is a matter for future reflection to decide. 
But in any event, these changed attitudes are here to stay and are likely to have a 
deep impact on our conceptualizations of acquired motivation. 

The second revolution is the extraordinary proliferation and development of 
cognitive concepts in psychology. If Tolman were alive today he could only revel 
in the fury of the cognitive assault that threatens to capture almost the full range 
of nonreflexive behavior in both humans and animals (12, 18, 65, 67, 73, 78, 84, 85, 
88, 107, 118); not even among the pages of the Journal of the Experimental Analysts 
of Behavior can one any longer find complete refuge (108). While the variety of 
cognitive concepts employed today is richer than in the past, the rigor with which 
these concepts are defined and applied varies enormously (e.g. 18 vs 73), and at times 
one cannot help wondering whether much more is being accomplished than relabel- 
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ing. Nevertheless, this development too promises to alter permanently long-held 
conceptions regarding learning and motivational processes (79). 

One common ground on which the biological-evolutionary and cognitive develop- 
ments join forces is the belief that many cognitive abilities are themselves the 
products of evolution responding to the pressures of natural selection. They are, to 
use a term endangered by overuse, species-specific behaviors. As Lenneberg (82) has 
put it, “every animal species must be assumed to have cognitive specificities,” a 
general thesis developed more recently in considerable detail by Razran (102). 

Against this background of conceptual reorientation—if not revolution—it re- 
quires little foresight to predict striking changes in our formulations of acquired 
motivation. We probably will see less of conditioned drives and conditioned rein- 
forcers established automatically by conventional conditioning techniques and 
thereafter controlling behavior in much the same way as the real things. We will 
see less effort devoted to reducing all motivational states to some common core 
drive, and still less of the tacit assumption that acquired motivation, whatever it 
might be, develops in essentially the same way across species and situations. On the 
other hand, we are likely to witness much more emphasis on the adaptive signifi- 
cance of acquired motivation, particularly within the ecological niche of the organ- 
ism, more attention paid to the special conditions that foster the acquisition of 
derived motives in different species, and greater sensitivity toward the structural and 
cognitive requirements of different systems of learned motives. Out of this web of 
increasing complexity there may one day emerge a theoretically satisfying, yet 
useful, general formulation of acquired motivation. 


Literature Cited 


i. Albert, D. J., Mah, C. J. 1972, An ex- nalled shock over shorter or weaker un- 
amination of conditioned reinforcement signalled shock. J. Exp. Anal. Behav. 
using a one-trial learning procedure. 19:25-32 
Learn. Motiv. 3:369-88 8. Badia, P., Culbertson, S., Lewis, P. 

2. Alvarez-Buyalla, R., Carrasco-Zanini, 1971. The relative aversiveness of sig- 
J. 1960. A conditioned reflex which re- nalled vs unsignalled avoidance. J. Exp. 
produces the hypoglycemic effect of in- Anal. Behav. 16:113-21 
sulin. Acta Physiol. Lat. Am. 9. Balagura, S, 1968. Conditioned gly- 
10:153~58 cemic responses in the control of food 

3. Appley, M. H 1970. Derived motives. intake. J. Comp. Physiol. Psychol. 65: 
Ann. Rev. Psychol. 21:485-518 30-32 

4. Attneave, F. 1959. Applications of In- 10. Bernstein, D. A. 1969. Modification of 
formation Theory to Psychology. New smoking behavior: an evaluative review. 
York: Holt. 120 pp. Psychol. Bull. 71:418-40 

5, Badia, P., Culbertson, S. 1970. Behav- ll. Bianco, F. J. 1972. Rapid treatment of 
ioral effects of signaled versus unsig- two cases of anorexia nervosa. J. Behav. 
naled shock during escape training in Ther. Exp. Psychiat. 3:223-24 
the rat. J. Comp. Physiol. Psychol. 72: 12. Bindra, D. 1972. A unified account of 
216-22 classical conditioning and operant 

6. Badia, P., Culbertson, S. 1972. The rela- training. See Ref. 13, 453-8] 
tive aversiveness of signalled vs unsig- 13. Black, A. H., Prokasy, W. F., Eds. 
nalled escapable and inescapable shock. 1972. Classical Conditioning HI: 
J. Exp. Anal. Behav. 17:463-71 Current Theory and Research. New 

7. Badia, P., Culbertson, S., Harsh, J. York: Appleton-Century-Crofts. 497 


1973. Choice of longer or stronger sig- PP- 


102 


21. 


22. 


23. 


24, 


25. 


26, 
27. 
28. 


29. 


30. 


dk 


32. 


D’AMATO 


Black, A. H., Young, G. A. 1972. Con- 
straints on the operant conditioning of 
drinking. See Ref. 62, 35-50 


. Blinder, B. J., Freeman, D. M. A. 


Stunkard, A. J. 1970. Behavior therapy 
of anorexia nervosa: effectiveness of ac- 
tivity as a reinforcer of weight gain. 
Am. J. Psychiat. 126:1093-98 


. Bloomfield, T. M. 1972. Reinforcement 


schedules: contingency or contiguity? 
See Ref. 62, 165-208 


. Bolles, R. C. 1967. The Theory of Moti- 


vation. New York: Harper & Row. 546 


DD, 
. Bolles, R. C. 1972. Reinforcement, ex- 


pectancy, and learning. Psychol. Rev. 
79:394-409 


. Bolles, R. C., Moot, S. A. 1972. Derived 


motives. Ann. Rev. Psychol. 23:51--72 


. Booth, D. A. 1972. Conditioned satiety 


in the rat. J. Comp. Physiol. Psychol. 
81:457--71 

Booth, D. A., Brookover, T. 1968. Hun- 
ger elicited in the rat by a single injec- 
tion of bovine crystalline insulin. 
Physiol. Behav. 3:439-46 

Booth, D. A., Lovett, D., McSherry, G. 
M. 1972. Postingestive modulation of 
the sweetness preference gradient in the 
rat. J. Comp. Physiol. Psychol. Mo- 
nogr. 18:485-512 

Booth, D A., Pitt, M. E. 1968. The role 
of glucose in insulin-induced feeding 
and drinking. Physiol. Behav. 3:447-53 
Brown, J. S., Farber, I. E. 1968. Second- 
ary motivational systems. Ann. Rev. 
Psychol. 19:99-134 

Brown, P. L., Jenkins, H. M. 1968. 
Auto-shaping of the pigeon’s key peck. 
J. Exp. Anal. Behav. 11:1-8 

Bruch, H. 1969. Hunger and instincts. 
J. Nerv. Ment. Dis. 149:91-114 
Bruch, H. 1971. Anorexia nervosa in 
the male. Psychosom. Med. 33:31-47 
Brush, F. R., Ed. 1971. Aversive Condi- 
tioning and Learning. New York: Aca- 
demic. 626 pp. 

Bykov, K. M. 1957. The Cerebral Cor- 
tex and the Internal Organs, transl. W. 
H Gantt. New York: Chemical Publ. 
448 pp. 

Cantor, M. B., LoLordo, V. M. 1970. 
Rats prefer signaled reinforcing brain 
stimulation to unsignaled ESB. J. 
Comp. Physiol. Psychol. 71:183-91 
Ibid 1972. Reward value of brain stimu- 
lation is inversely related to uncertainty 
about its onset. 79:259-70 

Carder, B., Berkowitz, K. 1970. Rats’ 
preference for earned in comparison 
with free food. Science 167:1273-74 


33. 


34. 


35. 


36. 


37. 


38. 


39. 


40. 


41. 


42. 


43. 


45. 


46. 


Cofer, C. N., Appley, M. H. 1964. Mo- 
tivation: Theory and Research. New 
York: Wiley. 958 pp. 

Cott, A., Wayner, M. J., Millner, J. 
1972. Conditioned drinking: a specific 
or nonspecific response? Physiol 
Behav. 9:219-27 

Cravens, R. W., Renner, K. E. 1970. 
Conditioned appetitive drive states: em- 
pirical evidence and theoretical status. 
Psychol. Bull. 73:212-20 

D’Amato, M. R. 1970. Experimental 
Psychology: Methodology, Psychophy- 
sics and Learning. New York: 
McGraw-Hill. 727 pp. 

D’Amato, M. R., Etkin, M., Fazzaro, J. 
1968. Cue-producing behavior in the 
capuchin monkey during reversal, ex- 
tinction, acquisition, and overtraining. 
J. Exp. Anal. Behav, 11:425-33 
Deaux, E., Kakolewski, J. W. 1970. 
Emotionally induced increases in effec- 
tive osmotic pressure and subsequent 
thirst. Science 169:1226-28 

Deaux, E., Sato, E., Kakolewski, J. W. 
1970. Emergence of systemic cues evok- 
ing food-associated drinking. Physiol. 
Behav. 3:1177-79 

DiCara, L. V., Braun, J. J., Pappas, B. 
A. 1970. Classical conditioning and 
instrumental learning of cardiac and 
gastrointestinal responses following re- 
moval of neocortex in the rat. J. Comp. 
Physiol. Psychol. 73:208-16 
Dinsmoor, J. A., Browne, M. P., Law- 
rence, C. E. 1972. A test of the negative 
discriminative stimulus as a reinforcer 
of observing. J, Exp. Anal, Behav. 18: 
79-85 

Dole, V. P. 1971. Research on metha- 
done maintenance treatment. In Metha- 
done Maintenance, ed. S., Einstein, 
13-27, New York: Marcel Dekker. 249 


pp. 

Domjan, M., Wilson, N. E. 1972. Speci- 
ficity of cue to consequence in aversion 
learning in the rat. Psychon. Sci. 26: 
143-45 


. Egger, M. D., Miller, N. E. 1962. Sec- 


ondary reinforcement in rats as a func- 
tion of information value and reliability 
of the stimulus. J. Exp. Psychol. 64: 
97-104 

Egger, M. D., Miller, N. E. 1963. When 
is a reward reinforcing?: an experimen- 
tal study of the information hypothesis. 
J. Comp. Physiol. Psychol. 56:132- 
37 

Eisenberger, R. 1972. Explanation of 
rewards that do not reduce tissue needs. 
Psychol. Bull, 77:319-39 


47. 


48. 


49, 


51. 


92, 
33; 


54. 


93. 


56. 


57. 


58. 


59. 


60. 


61. 
62. 


63. 


Epstein, A. N., Fitzsimons, J. T., Rolls, 
B. J. 1970. Drinking induced by injec- 
tion of angiotensin into the brain of the 
rat. J. Physiol. London 210:457-74 
Epstein, S. 1967. Toward a unified the- 
ory of anxiety. In Progress in Experi- 
mental Personality Research, ed. B. A. 
Maher, 4:1-89. New York: Academic. 
323 pp. 

Falk, J. L. 1971. The nature and deter- 
minants of adjunctive behavior. 
Physiol. Behav. 6:577--88 


. Faik, J. L. 1971. Determining changes 


in vital functions: ingestion. In Methods 
in Psychobiology, ed. R. D. Myers, 1: 
301-29. New York: Academic. 356 


pp. 
Fenz, W. D., Jones, G. B. 1972. Individ- 
ual differences in physiologic arousal 
and performance in sport parachutists. 
Psychosom. Med. 34:1-8 
Fitzsimons, J. T. 1972. Thirst. Physiol. 
Rev. 52:468-561 
Fitzsimons, J. T., Le Magnen, J. 1969. 
Eating as a regulatory control of drink- 
ing in the rat. J. Comp. Physiol, Psy- 
chol. 67:273-83 
Furedy, J. J., Klajner, F. 1972. Prefer- 
ence for information about an unmodih- 
able but rewarding outcome. J. Exp. 
Psychol. 95:469--71 
Gamzu, E., Schwartz, B. 1973. The 
maintenance of key pecking by stimu- 
lus-contingent and response-independ- 
ent food presentation. J. Exp. Anal. 
Behav. 19:65-72 
Gantt, W. H. 1972. Organ-system re- 
sponsibility, homeostasis and the condi- 
tional reflex. Cond. Reflex 7:1-10 
Gantt, W. H., Katzenelbogen, S., 
Loucks, R. B. 1937. An attempt to con- 
dition adrenalin hyperglycemia. Bull. 
Johns Hopkins Hosp. 60:400-11 
Garcia, J., Koelling, R. A. 1966. Rela- 
tion of cue to consequence in avoidance 
learning. Psychon. Sci. 4:123-24 
Garcia, J., McGowan, B. K., Green, K. 
F. 1972. Biological constraints on con- 
ditioning. See Ref. 13, 3-27 
Garner, W. R. 1962. Uncertainty and 
Structure as Psychological Concepts. 
New York: Wiley. 369 pp. 
Gilbert, R. M. 1972, Variation and se- 
lection of behavior. See Ref. 62, 263-76 
Gilbert, R. M., Millenson, J. R., Eds. 
1972. Reinforcement: Behavioral Anal- 
yses. New York: Academic. 288 pp. 
Green, L., Bouzas, A., Rachlin, H. 
1972. Test of an electric-shock analog to 
Cre amas aversion. Behav. Biol. 
:513- 


64. 


65. 


66. 


67. 


68. 


69. 
70. 


71. 


72, 


13. 


74. 


75. 


76. 


77. 


78. 


79. 


80. 


DERIVED MOTIVES 103 


Greenwood, F. C., Landon, J., Stamp, 
T. C. B. 1966. The plasma sugar, free 
fatty acid, cortisol, and growth hor- 
mone response to insulin. I. In control 
subjects. J. Clin. Invest. 45:429~36 
Grings, W. W. 1973. Cognitive factors 
in electrodermal conditioning. Psychol. 
Bull. 79:200-10 

Grossman, M. I., Stein, I. F. Jr. 1948. 
Vagotomy and the hunger-producing 
action of insulin in man. J. Appl. 
Physiol. 1:263-69 

Hammond, K. R., Summers, D. A. 
1972. Cognitive control. Psychol. Rev. 
79:58-67 

Hendry, D. P., Ed. 1969. Conditioned 
Reinforcement. Homewood, Ill.: Dor- 
sey. 454 pp. 

Ibid. Introduction, 1-33 

Hershiser, D., Trapold, M. A. 1971. 
Preference for unsignaled over signaled 
direct reinforcement in the rat. J 
Comp. Physiol. Psychol. 77:323-28 
Hunt, W. A., Ed. 1970. Learning Mech- 
anisms in Smoking. Chicago: Aldine. 
237 pp. 

Hutton, R. A., Woods, S. C., Makous, 
W. L. 1970. Conditioned hypoglycemia: 
pseudoconditioning controls. J. Comp. 
Physiol. Psychol. 7171:198-201 

Irwin, F. W. 1971. Intentional Behavior 
and Motivation: A Cognitive Theory. 
New York: Lippincott. 201 pp. 
Jacobs, H. L., Sharma, K. N. 1969. 
Taste versus calories: sensory and meta- 
bolic signals in the control of food 
intake. Ann NY Acad. Sci. 157: 
1084-1125 

Jensen, G. D. 1963. Preference for bar 
pressing over “freeloading” as a func- 
tion of number of rewarded presses. J. 
Exp. Psychol. 65:451-54 

Jordan, H. A. 1969. Voluntary intragas- 
tric feeding: oral and gastric contribu- 
tions to food intake and hunger in man. 
J. Comp. Physiol. Psychol. 68:498-506 
Kavanau, J. L. 1967, Behavior of cap- 
tive white-footed mice. Science 155: 
1623-39 

Kimble, G. A. 1971. Cognitive inhibi- 
tion in classical conditioning. In Essays 
in Neobehaviorism: A Memorial Vol- 
ume to Kenneth W. Spence, ed. H. H 
Kendler, J. T. Spence, 69-87. New 
York: Appleton-Century-Crofts. 345 


pp. 
Kimble, G. A. 1973. Scientific psy- 
chology in transition. See Ref. 92, I~ 
19 


Kutscher, C. 1966. Effect of hypertonic 
saline injections and water deprivation 


104 


8]. 


82. 
83. 


84. 


85. 


86. 


87. 


88. 


89. 


90. 


9. 


92. 


93. 


94. 


95. 


96. 


D AMATO 


on drinking, serum osmolality and gut 
water. Physiol. Behav. 1:259-68 
LeMagnen, J. 1969. Peripheral and sys- 
temic actions of food in the caloric regu- 
lation of intake. Ann. NY Acad. Set. 
157:1126-57 

Lenneberg, E. H. 1969. On explaining 
language. Science 164:635-43 
Lieberman, D. A. 1972. Secondary rein- 
forcement and information as determi- 
nants of observing behavior in monkeys 
(Macaca mulatta). Learn. Motiv. 3: 
341-58 

Lockhart, R. A. et al 1973. Classical 
conditioning and the cognitive pro- 
cesses: a symposium. Psychophysiology 
10:74-122 

Longstreth, L. E. 1972. A cognitive in- 
terpretation of secondary reinforce- 
ment. In Nebraska Sympostum on 
Motivation, 1971, ed. J. K. Cole, 33-80. 
Lincoln: Univ. Nebraska Press. 304 pp. 
Lovett, D., Goodchild, P., Booth, D. A. 
1968. Depression of intake of nutrient 
by association of its odor with effects of 
insulin. Psychon. Sci. 11:27--28 
MacDonald, L., Baron, A. 1973. A rate 
measure of the relative aversiveness of 
signalled vs unsignalled shock. J. Exp. 
Anal. Behav, 19:33-38 

Macnamara, J. 1972. Cognitive basis of 
language learning in infants. Psychol. 
Rev. 79:1-13 

Mahoney, M. J. 1972. Research issues 
in self-management. Behav. Ther. 3: 
45-63 

Mahoney, M. J., Bandura, A. 1972. 
Self-reinforcement in pigeons. Learn. 
Motiv. 3:293-303 

McFarland, D. J. 1970. Recent develop- 
ments in the study of feeding and drink- 
ing in animals. J. Psychosom. Res. 
14:229--37 

McGuigan, F. J., Lumsden, D. B., Eds. 
1973. Contemporary Approaches to 
Conditioning and Learning. Washing- 
ton: Winston. 321 pp. 

Miller, N. E. 1969. Learning of visceral 
and glandular responses. Science 163: 
434-45 

Mineka, S. 1972. Conditioned appeti- 
tive drives: some new evidence and a 
new theoretical status. Presented at 
Eastern Psychol. Assoc. Meet., Boston 
Mineka, S., Seligman, M. E. P., He- 
trick, M., Zuelzer, K. 1972. Poisoning 
and conditioned drinking. J. Comp. 
Physiol. Psychol. 79:377-84 
Neuringer, A. J. 1969. Animals respond 
for food in the presence of free food. 
Science 166:399-401 


97. 


98. 


99, 


100. 


101. 


102. 


103. 


104. 


105. 


106. 


107. 


108. 


109. 


110, 


111. 


Ibid 1970. Many responses per food re- 
ward with free food present. 169:503- 
4 


Nicolaidis, S. 1969. Early systemic re- 
sponses to orogastric stimulation in the 
regulation of food and water balance: 
functional and electrophysiological 
data. Ann. NY Acad. Scr. 157:1176- 
1203 

Nisbett, R. E. 1972. Hunger, obesity, 
and the ventromedial hypothalamus. 
Psychol. Rev. 79:433-53 

Penick, S. B., Filion, R., Fox, S., Stunk- 
ard, A. J. 1971. Behavior modification 
in the treatment of obesity. Psychosom. 
Med, 33:49--55 

Perkins, C. C. Jr. 1968. An analysis of 
the concept of reinforcement. Psychol. 
Rev. 75:155~-72 

Razran, G. 1971. Mind in Evolution: 
An East-West Synthesis of Learned Be- 
havior and Cognition. Boston: Hough- 
ton Mifflin. 430 pp. 

Rescorla, R. A. 1972. Informational 
variables in Pavlovian conditioning. In 
The Psychology of Learning and Moti- 
vation: Advances in Research and The- 
ory, ed. G. H. Bower, 6:i-46. New 
York: Academic. 312 pp. 

Rescorla, R. A., Wagner, A. R. 1972. A 
theory of Pavlovian conditioning: varia- 
tions in the effectiveness of reinforce- 
ment and nonreinforcement. See Ref. 
13, 64-99 

Revusky, S., Garcia, J. 1970. Learned 
associations over long delays. In The 
Psychology of Learning and Motiva- 
tion: Advances in Research and Theory, 
ed. G. H. Bower, 4:1-84. New York: 
Academic. 339 pp. 

Rozin, P., Kalat, J. W. 1971. Specific 
hungers and poison avoidance as adap- 
tive specializations of learning. Psychol. 
Rev. 78:459-86 

Saltz, E. 1971. The Cognitive Bases of 
Human Learning. Homewood, Hi.: 
Dorsey. 510 pp. 

Salzinger, K. 1973. Inside the black 
box, with apologies to Pandora. A re- 
view of Ulric Neisser’s Cognitive Psy- 
chology. J. Exp. Anal. Behav. 
19:369--78 

Schachter, S 1971. Emotion, Obesity, 
and Crime. New York: Academic. 195 


pp. 

Schaub, R. E. 1969. Response-cue con- 
tingency and cue effectiveness. See Ref. 
68, 342-56 

Schwartz, B. 1972. The role of positive 
conditioned reinforcement in the main- 
tenance of keypecking which prevents 


112, 


113. 


114. 


115. 


116. 


117. 


118. 


119. 


120. 


121. 


122. 


123. 


124. 


125. 


126. 


delivery of primary reinforcement. 
Psychon. Sei. 28:277-78 

Scrignar, C. B. 1971. Food as the rein- 
forcer in the outpatient treatment of 
anorexia nervosa. J. Behav. Ther. Exp. 
Psychiat. 2:31-36 

Segal, E F. 1972. Induction and the 
provenance of operants. See Ref. 62, 
1-34 

Seligman, M. E. P. 1970. On the gener- 
ality of the laws of learning. Psychol. 
Rev. 77:406-18 

Seligman, M. E. P., Bravman, S., Rad- 
ford, R. 1970. Drinking: discriminative 
conditioning in the rat. Psychon. Sci. 
20:63--64 

Seligman, M. E. P., Hager, J. L., Eds. 
1972. Biological Boundaries of Learn- 
ing. New York: Appleton-Century- 
Crofts. 480 pp. 

Seligman, M. E. P., Ives, C. E., Ames, 
H., Mineka, S. 1970. Conditioned 
drinking and its failure to extinguish: 
avoidance, preparedness, or functional 
autonomy? J. Comp. Physiol. Psychol. 
71:411-19 

Seligman, M. E. P., Johnston, J. C. 
1973. A cognitive theory of avoidance 
learning. See Ref. 92, 69-110 
Seligman, M. E. P., Mineka, S., Fillit, 
H 1971. Conditioned drinking pro- 
duced by procaine, NaCl, and angioten- 
sin. J. Comp. Physiol. Psychol. 
77:110-21 

Shettleworth, S. J. 1972. Constraints on 
learning. In Advances in the Study of 
Behavior, ed. D. S. Lehrman, R. A. 
Hinde, E. Shaw, 4:1-68. New York: 
Academic. 217 pp. 

Shettleworth, S. J. 1972. Stimulus rele- 
vance in the control of drinking and 
conditioned fear responses in domestic 
chicks (Gallus gallus). Comp. 
Physiol. Psychol. 80:175-98 

Siegel, S. 1972. Conditioning of insulin- 
induced glycemia. Comp. Physiol. 
Psychol. 78:233-41 

Siegel, S., Nettleton, N. 1970. Condi- 
tioning of insulin-induced hyperphagia. 
J. Comp. Physiol. Psychol. 72:390-93 
Silverstein, A. 1972. Secondary rein- 
forcement in infants. J. Exp. Child Psy- 
chol. 13:138-44 

Singh, D. 1970. Preference for bar 
pressing to obtain reward over freeload- 
ing in rats and children. J. Comp. 
Physiol. Psychol. 73:320-27 

Ibid 1972. Preference for mode of ob- 
taining reinforcement in rats with le- 
sions in septal or ventromedial 
hypothalamic area. 80:259-68 


127. 


128. 


129: 


130. 


131; 


132. 


133. 


134. 


135. 


136. 


137, 


138. 
139. 


140. 


141. 


142. 


DERIVED MOTIVES 105 


Singh, D., Query, W. T. 1971. Prefer- 
ence for work over “freeloading” in 
children. Psychon. Sci. 24:77-19 
Smith, T. D. 1972. Development of to- 
kens as secondary reinforcers. J. Exp. 
Child Psychol. 14:133-38 

Solomon, R. L., Corbit, J. D. 1973. An 
opponent-process theory of motivation: 
II. Cigarette addiction. J. Abnorm. Psy- 
chol. 81:158~71 

Solomon, R. L., Wynne, L. C. 1954. 
Traumatic avoidance learning: the prin- 
ciples of anxiety conservation and par- 
tial irreversibility. Psychol. Rev. 
61:353-85 

Spatz, C., Jones, S. D. 1971. Starvation 
anorexia as an explanation of “self-star- 
vation” of rats living in activity wheels. 
J. Comp. Physiol. Psychol. T1:313-17 
Staddon, J. E. R., Simmelhag, V. L. 
1971. The “superstition” experiment: a 
reexamination of its implications for the 
principles of adaptive behavior. Psy- 
chol. Rev. 78:3-43 

Stellar, E. 1967. Hunger in man; com- 
parative and physiological studies. Am. 
Psychol. 22:105-17 

Stolz, S. B., Lott, D. F. 1964. Establish- 
ment in rats of a persistent response 
producing a net loss of reinforcement. 
J. Comp. Physiol. Psychol. 57:147-49 
Taylor, G. T. 1972. A limitation of the 
contrafreeloading phenomenon. Psy- 
chon. Sct, 29:173-74 

Waddell, T. R., Leander, J. D., Webbe, 
F. M., Malagodi, E. F. 1972, Schedule 
interactions in second-order fixed-inter- 
val (fixed-ratio) schedules of token rein- 
forcement. Learn. Motiv, 3:91-100 
Walike, B. C., Jordan, H. A., Stellar, E. 
1969. Preloading and the regulation of 
food intake in man. J. Comp. Physiol. 
Psychol, 68:327-33 

Wayner, M. J., Carey, R. J. 1973. Basic 
drives. Ann. Rev. Psychol. 24:53-80 
Wayner, M. J., Cott, A. 1971. Condi- 
tioned drinking and the effects of sac- 
charin on its recovery after lateral 
hypothalamic lesions. Physiol. Behav. 
7:201-6 

Wayner, M. J. Jr., Emmers, R. 1959. A 
test of the thirst-deprivation hypothesis 
in the hooded rat. J Comp. Physiol. 
Psychol. 52:112-15 

Wayner, M. J., Gawronski, D., Mueller, 
R. 1967, Effects of procaine-HCi on the 
salt arousal of drinking. Physiol. Behav. 
2:249-54 

Wike, E. L. 1970. Secondary reinforce- 
ment: some research and theoretical is- 
sues. In Nebraska Symposium on 


106 


143. 


144. 


145. 


146. 


147. 


148. 


149. 


D’AMATO 


Motivation, 1969, ed. W. J. Arnold, D. 
Levine, 39-84. Lincoln: Univ. Nebraska 
Press. 334 pp. 

Wilcoxon, H. C., Dragoin, W. B., Kral, 
P. A. 1971. Illness-induced aversions in 
rat and quail: relative salience of visual 
and gustatory cues. Science 171:826-28 
Williams, D. R., Williams H 1969. 
Auto-maintenance in the pigeon: sus- 
tained pecking despite contingent non- 
reinforcement. J. Exp. Anal. Behav. 
12:511-20 

Wilton, R. N. 1972. The role of infor- 
mation in the emission of observing re- 
sponses and partial reinforcement 
acquisition phenomena. Learn. Motiv. 
3:479-99 

Wilton, R. N., Clements, R. O. 1971. 
Observing responses and informative 
Su: J. Exp. Anal. Behay. 15:199- 
204 

Ibid. The role of information in the 
emission of observing responses: a test 
of two hypotheses. 16:161-66 

Woods, S., Makous, W., Hutton, A. 
1968. A new technique for conditioned 
hypoglycemia. Psychon. Sci. 10:389-90 
Woods, S. C., Makous, W., Hutton, R. 
A. 1969. Temporal parameters of condi- 
tioned hypoglycemia. J. Comp. Physiol. 
Psychol, 69:301-7 


150. 


131. 


152. 


153. 


154. 


155. 


156. 


Woods, S. C., Shogren, R. E. Jr. 1972. 
Glycemic responses following condi- 
tioning with different doses of insulin in 
rats. J. Comp. Physiol. Psychol, 81: 
220-25 

Wooley, O. W., Wooley, S. C., Dun- 
ham, R. B. 1972. Can calories be per- 
ceived and do they affect hunger in 
obese and nonobese humans? J. Comp. 
Physiol, Psychol. 80:250-58 

Wooley, S. C. 1972. Physiologic versus 
cognitive factors in short term food reg- 
ulation in the obese and nonobese. Psy- 
chosom. Med. 34:62-68 

Wyckoff, L. B. Jr. 1952. The role of 
observing responses in discrimination 
learning. Part 1. Psychol. Rev. 59: 
431-42 

Ibid 1959. Toward a quantitative theory 
ar secondary reinforcement. 66:68- 


Zahorik, D. M. 1972. Conditioned 
physiological changes associated with 
learned aversions to tastes paired with 
thiamine deficiency in the rat. J. Comp. 
Physiol. Psychol, 79:189-200 
Zahorik, D. M., Maier, S. F. 1969. Ap- 
petitive conditioning with recovery 
from thiamine deficiency as the uncon- 
ditioned stimulus. Psychon. Sci. 17: 
309-10 


Copyright 1974. All rights reserved 


ANIMAL LEARNING—VISCERAL #212 
AND AUTONOMIC CONDITIONING 


Alan H Harris and Joseph H Brady’ 
The Johns Hopkins University School of Medicine, Baltimore, Maryland 


This chapter will be concerned with the effects of behavioral conditioning proce- 
dures, both classical and instrumental, upon physiological response measures com- 
monly referred to as “visceral” and autonomic (e.g. heart rate, blood pressure, 
gastric motility). Within this general framework, the laboratory animal research to 
be reviewed can be divided into three broad investigative categories based upon the 
relationship between the physiological measures and the conditioning procedures. 
In the first Classical Autonomic Conditioning category, the physiological event that 
is conditioned appears initially as an unconditioned response (e.g. salivation) to an 
unconditioned stimulus (e.g. food) and is subsequently observed to occur (though 
not necessarily in identical form) during presentation of a conditioned stimulus (e.g. 
bell) which has been paired repeatedly with the unconditioned stimulus. The second 
Concurrent Autonomic Conditioning category is defined by experimental ap- 
proaches that depend primarily upon operant conditioning procedures to establish 
and maintain ongoing performances (e.g. lever pressing) while concurrently measur- 
ing physiological changes (e.g. hormone elevations) which show systematic relation- 
ships to the conditioned instrumental behavior. And in the third Instrumental 
Autonomic Conditioning category, an essential feature of the procedure emphasizes 
a contingency relationship between antecedent physiological changes (e.g. heart rate 
increase) and experimentally programmed environmental consequences (e. g. food 
delivery and/or shock avoidance). 

Over the past decade, animal learning studies within the framework of these three 
autonomic conditioning paradigms clearly have focused more upon some physiolog- 
ical response systems (e. g. cardiovascular) than others (e.g. gastrointestinal), and 
this differential emphasis will of necessity be reflected in the ensuing categorical 
review. To a considerable extent, this uneven distribution of physiological measures 
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is a function of the technological and methodological developments which have 
paced the emergence of a scientifically operational laboratory psychophysiology. In 
this regard it is appropriate to acknowledge at least some of the major innovations 
and refinements upon which this burgeoning research domain depends. Particularly 
noteworthy have been the technical advances in the recording and measurement of 
heart rate (30, 48, 63, 87, 98, 147, 193, 202, 207), blood pressure (74, 107, 120, 146, 
149, 201, 245, 264), hormone levels (166), and electromyographic activity (77, 118) 
in both restrained and free-moving animals. As these and other psychophysiological 
developments (50) have increased the ease and accessibility of visceral and auto- 
nomic measurement techniques, an ever-broadening range of biological events have 
been exposed to experimental scrutiny in relationship to behavioral conditioning 
procedures. 


CLASSICAL AUTONOMIC CONDITIONING 


The historical roots of an extensive research literature on classical autonomic condi- 
tioning can be traced at least as far back as the early studies of Pavlov and Sherring- 
ton before the turn of the century. Numerous volumes have been devoted to a broad 
range of psychophysiological experiments which document the central role of such 
conditioning processes in the adaptations and adjustments of the internal environ- 
ment (1, 17, 81, 140, 195, 196, 204, 208). Despite this illustrious heritage, the past 
decade has produced a host of research reports which appear to justify few general- 
izations about the classical conditioning of autonomic responses and contribute 
more to the complexity of the process than to its clarification. This confusing state 
of affairs is nowhere more evident than in the veritable tidal wave of classical cardiac 
conditioning studies which have all but inundated the literature in this once-pristine 
area of experimental inquiry. 

Shearn’s 1961 answer to his own rhetorical question “Can the heart learn?” 
provided convincing evidence of the wide-ranging variability in the form (i.e. accel- 
eration or deceleration) of the classically conditioned heart rate response (226). A 
most charitable characterization of the research on this problem over the past 10 
years would represent the reported studies as an attempt to examine the conditions 
under which such form variations are seen to occur. In this regard, both aversive 
(e.g. electric shock UCS) and appetitive (e.g. food UCS) cardiac conditioning experi- 
ments have confirmed that all possible outcomes can be observed under some 
conditions. Conditioned cardiac responses to a stimulus paired with shock continue 
to be reported as an acceleration in heart rate by some authors (29, 51, 86, 136, 182, 
254) and as a deceleration in heart rate by others (64, 96, 125, 194, 215, 248). Similar 
variability in the form of the conditioned cardiac response have been reported (222) 
with a procedure involving tone-milk pairings which produce both accelerative and 
decelerative changes in heart rate within experimental sessions, though the occur- 
rence of the conditioned response (regardless of its form) was found to be a reliable 
predictor of subsequent consummatory behavior. There is, of course, some basis in 
these conflicting reports for considering the role of species differences (i.e. between 
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rodents, carnivores, and primates), but both increases and decreases have been 
reported within the same species by investigators using essentially the same proce- 
dures (21, 31, 168, 176, 182, 254) and even with the same animal (29, 160, 222). 

In this latter regard, several authors (82, 205, 206, 222, 234, 236, 266) using 
several different species (i.e. laboratory rats, dogs, and monkeys) and CS durations 
(i.e. 12-60 seconds) have all reported a characteristically biphasic cardiac response 
pattern which emerges after repeated CS-UCS pairings. The form in which this 
pattern has been observed is that of an early heart rate acceleration (during the 
initial seconds of the CS) followed by a deceleration in heart rate during the terminal 
seconds of the CS interval before UCS presentation. On a different time scale, the 
experiments reported by Black & Black (29) using a tone-shock pairing procedure 
with rats confirmed this biphasic variation in the form of the conditioned cardiac 
response over successive trials, the heart rate change appearing in decelerative form 
during early trials and developing into an accelerative CR during later trials. To 
some extent at least, these results suggest the possibility that separable response 
components (e.g. “orienting”) may participate differentially in the temporal course 
of cardiac conditioning (135), or that extraneous procedural factors (e.g. “effect of 
person”) involving ongoing subject-experimenter interactions (161, 162) may influ- 
ence the emergent form of the conditioned cardiac response (13). 

Although the results of some studies (169, 269) have indicated that the parametric 
dimensions of the UCS (e.g. shock intensity) may account, at least in part, for some 
of the conditioned cardiac response variability, others (56, 153) suggest that the 
direction of the conditioned cardiac rate change is independent of UCS (electric 
shock) intensity. The most constructive methodological suggestion to emerge from 
recent experimental reports may be the attempt by Lynch (160) to confront the 
variability in heart rate as a lawful datum in providing a functional description of 
the cardiac classical conditioning process. With dogs given tone-shock pairings, 
heart rate variability was found to be consistently decreased during tone presenta- 
tions as compared to non-tone periods, the day-to-day variability measures proving 
to be a more reliable index of cardiac conditioning than the heart rate per se, which 
showed no systematic change over the succession of conditioning trials. 

Several cardiac conditioning studies over the past decade have explored the 
application of the Law of Initial Values (LIV) as a factor in determining the effects 
of classical autonomic conditioning procedures. In at least three reports (26, 206, 
236) some support was provided for a dependence of CR magnitude upon pre-CS 
heart rate level, an inverse relationship having been observed between the classically 
conditioned heart rate change and the cardiac rate during the time interval immedi- 
ately preceding the CS presentation. Controlling the pre-CS heart rate with a cardiac 
pacemaker, however, did not reveal systematic relationships between the classically 
conditiond heart rate response and the paced pre-CS heart rate level (236), suggest- 
ing that the interactions which define LIV effects reflect more the participation of 
central than peripheral factors. A more recent study by Downer & Thompson (78), 
on the other hand, with four different primate species (i.e. baboons, rhesus monkeys, 
spider monkeys, and squirrel monkeys) failed to confirm the LIV effects which have 
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been so well documented in cardiovascular experiments with humans (126, 150), 
and reiterated in yet another dimension the conflicts and confusions which charac- 
terize this classical autonomic conditioning field. 

Trace and Delay procedures of long-standing interest have continued to receive 
experimental attention in classical cardiac conditioning studies concerned with the 
relative power of these alternative paradigms. Several reports based upon both 
appetitive (e.g. light-food pairings) conditioning procedures (222) and aversive (e.g. 
tone-shock pairings) conditioning procedures (95, 248) have confirmed the superi- 
ority of the delay procedure over the trace procedure with regard to the magnitude 
and reliability of the classically conditioned heart rate response. The differences 
between the two procedures do appear to be attenuated, however, in aversive classi- 
cal conditioning experiments as the intensity of the UCS (i.e. shock) is increased (95, 
248). | 

Procedural parameters related to trial frequency and reinforcement schedule (i.e. 
UCS intermittency) have also continued to receive experimental attention as deter- 
minants of classically conditioned cardiac response magnitude and durability. Sin- 
gle-trial conditioning has been reported (182) in dogs with one tone-shock pairing 
producing heart rate changes in response to tone presentations which persisted over 
extended time intervals and during repeated extinction trials. Partial reinforcement 
effects upon resistance to extinction (19, 183) have generally been confirmed in more 
recent studies (93, 94, 96, 97, 123, 253), although in at least one instance (252) no 
difference was found in this regard between partial and continuous reinforcement 
conditions. Similarly, classically conditioned cardiac accelerative responses in 
rhesus monkeys have been reported to show no difference between partial and 
continuous reinforcement procedures, though discrimination of a negative (non- 
shocked) stimulus (CS-) appears to be more readily established when the CS+ is 
continuously reinforced (176). 

Such generalization and discrimination effects have been investigated in several 
other classical cardiac conditioning studies over the past decade with the usual range 
of form and magnitude divergencies in evidence. To some extent at least, procedural 
variables appear to account for recurrent inconsistencies in the findings from differ- 
ent studies, and the relationship between CS+ (i.e. CS presentations followed by the 
UCS) and CS- (i.e. CS presentations not followed by the UCS) trials emerges as an 
important determinant of the reported outcomes with the conditioned cardiac re- 
sponse to the CS+ significantly greater than to the CS- in all cases (21, 64, 83, 136, 
176, 191, 194, 234, 246). An additional study (266), however, appeared to identify 
discrimination difficulty as a parameter influencing the magnitude of the condi- 
tioned heart rate response regardless of the training procedure used, while another 
(83) reported that blood pressure discriminations were less stable than heart rate 
discriminations to the same CS+ and CS-. 

Several recent reports have also described both correlated and uncorrelated 
changes in other circulatory measures during classical conditioning based, for the 
most part, upon traditional tone-shock pairings. Blood pressure (83, 88, 156, 181, 
198, 268), blood flow (198), catecholamine levels (88), and peripheral vasomotor 
changes (110, 111) have all been shown to be affected by such classical conditioning 
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procedures, though agreement is somewhat less than unanimous with regard to 
differentiation of the “orienting” or ‘‘startle’ components from the “true” condi- 
tioned autonomic response pattern (135). The results of one of these studies (181), 
however, involving determinations made with dogs under both curarized and non- 
curarized conditions, suggest at least some degree of independence between such 
classically conditioned cardiovascular responses on the one hand and respiratory 
and skeletal muscle changes on the other. 

Classically conditioned cardiac response mechanisms have been recurrently ana- 
lyzed with the aid of neurophysiological and neuropharmacological research tech- 
niques for selectively evaluating the role of central and peripheral factors in the 
establishment and maintenance of such heart rate changes. Several studies (228, 249, 
254) have shown, for example, that direct stimulation of the brain can serve effec- 
tively as a UCS for cardiac conditioning, though direct peripheral stimulation of the 
ventricular myocardium has been reported (200) ineffective as a UCS for the classi- 
cal conditioning of extrasystoles. “Centrally mediated” extrasystoles, however, ap- 
parently can be conditioned (197, 199). Additional evidence for the central role of 
selective brain mechanisms in the mediation of cardiovascular conditioning effects 
has also been provided by ablation studies (79, 80) implicating both the ven- 
tromedial and dorsolateral tegmentum in control of the classically conditioned heart 
rate response. Neuropharmacological analysis of classical cardiac conditioning has 
also confirmed the role of central neural influences in the control of such heart rate 
changes, Hein (119) having found that the conditioned cardiac response was blocked 
by atropine but not by succinylcholine-induced muscle paralysis. Blockade of the 
sympathetic innervation has also been found to produce only a small reduction in 
the conditioned heart rate response even though a striking reduction in cardiac 
contractility change follows such sympathetic blockade (128). In a related series of 
studies, Schneiderman et al (216, 217) have analyzed the effects of alpha-adrenergic 
and cholinergic blockade upon both conditioned and unconditioned cardiovascular 
responses (e.e. blood pressure and heart rate) in the albino rabbit. Although the 
results of these experiments showed clearly that blood pressure changes contributed 
importantly to both conditioned and unconditioned heart rate changes, it was also 
demonstrated that conditioned cardiac responses could occur in the absence of 
conditioned blood pressure changes. 

The documented role of such regulatory homeostatic mechanisms in the control 
of classically conditioned cardiovascular responses would seem to require that a 
broader biological perspective be maintained with regard to the range of interacting 
visceral and somatic systems which in concert provide for the adjustments and 
adaptations of the internal environment. Within this context, the relationship be- 
tween heart rate and other visceral and motor components of a “total” classically 
conditioned response has been the primary focus of several researchers (21, 28, 57, 
61, 64, 83, 99, 119, 129, 159, 184-186, 190, 194, 209, 234, 238, 243, 250, 267, 268, 
270). Not unexpectedly, something less than a completely consistent account of 
these complex interactions appears to emerge, with some experimental reports 
emphasizing the independent variations of multiple interacting systems (21, 28, 61, 
64, 83, 129, 190, 194, 234, 238, 250, 267, 270), while others focus upon the interde- 
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pendent correlations between response measures (57, 99, 119, 159, 184-186, 209, 
268). Probably the most parsimonious view of this obviously complex process would 
recognize that all possible combinations and permutations can and do occur under 
some circumstances, and that it remains a continuing research challenge to delineate 
and define the range of independent conditions under which such classical auto- 
nomic learning is established. 


CONCURRENT AUTONOMIC CONDITIONING 


Though lacking the long-standing and prestigious laboratory background enjoyed 
by Classical Autonomic Conditioning, an active and productive research interest in 
the measurement of performance-related concurrent autonomic conditioning effects 
has emerged over the past several decades. Significantly, this investigative emphasis 
can be seen to bear a close relationship to the development of instrumental or 
operant learning analysis techniques which focus more directly upon behavioral 
interactions with the external environment. In an initial review of the area, Brady 
(34) surveyed several groups of laboratory studies which at the time appeared to 
define and delineate such research applications as they related to the experimental 
production of altered physiological states. For the most part, the emphasis upon 
endocrine, cardiovascular, and gastrointestinal changes that was apparent in those 
early studies has continued to characterize investigative progress over the past 
decade. In this regard, significant methodological advances related particularly to 
the analysis of hormone response patterns (166) and the measurement and recording 
of blood pressure (201, 245, 264) have contributed importantly to these develop- 
ments. In addition, increasing emphasis upon enduringly chronic preparations (33, 
90, 102, 121) has provided for more extended observation and experimental manipu- 
lation. Aversive behavioral control procedures of established effectiveness, including 
conditioned suppression (85) and free-operant avoidance (259), have continued to 
receive increasing psychophysiological attention, and a progressively more refined 
analysis of the observed interactions is reflected in major groups of concurrent 
conditioning studies concerned with endocrine and cardiovascular changes on the 
one hand, and gastrointestinal effects on the other. 


Endocrine and Cardiovascular Changes 


Systematic increases in plasma 17-hydroxycorticosteroid (17-OH-CS) levels have 
now been described, for example, in relationship to the acquisition of conditioned 
suppression (i.e. “conditioned emotional response” or CER) with monkeys (167), 
and the course of cardiovascular changes during the development and maintenance 
of such conditioned suppression has also been extensively studied in chronically 
catheterized primates with monitoring of heart rate and both systolic and diastolic 
blood pressure (45). Dramatic reversals in both the direction and magnitude of such 
blood pressure and heart rate changes were observed to occur in response to re- 
peated clicker-shock pairings. Initially the development of conditioned behavioral 
suppression over the first 8 to 10 trials for all 5 rhesus monkeys in the study was 
accompanied by a consistent and systematic decrease in both heart rate and blood 
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pressure in response to the 3-minute clicker presentations. Continued daily pairings 
of clicker and shock superimposed upon the lever-pressing performance, however, 
produced abrupt and sustained reversals in both the direction and magnitude of the 
cardiovascular response. Significantly, these changes were observed to persist as 
large magnitude increases in heart rate and both systolic and diastolic blood pressure 
in response to the behaviorally suppressing clicker presentations for from 50 to 100 
daily conditioning trials. The results were interpreted as reflecting the differential 
temporal development of the early CER instrumental performance suppression 
(with the initial cardiovascular “suppression” related to decreases in motor activity 
during the clicker) followed by the later-appearing “conditioned cardiac respon- 
dent” component as sustained cardiovascular activation during clicker presenta- 
tions. It is also of some interest to contrast the form of these concurrent biphasic 
cardiovascular changes with the biphasic form of the classically conditioned heart 
rate response described in the first section of this review. To some extent at least, 
these several findings provide a basis for explaining the recurrent conflicting reports 
(62, 64, 65, 180, 194, 233, 234, 237, 240, 243, 265) regarding CER effects upon the 
cardiovascular system and point conspicuously to the temporal course over which 
some experimental observations are made as a critical source of contributory vari- 
ance. More importantly, the results of these studies seem to argue for “causal 
independence” of the separable but interacting physiological and behavioral re- 
sponses defining such “emotional” conditioning. 

The extended analysis of behaviorally induced endocrine and cardiovascular 
changes over the past decade has focused even more intensively upon conditioned 
avoidance models, predominantly of the free-operant or Sidman variety (229). In 
addition to multiple reconfirmations (33, 35) of the twofold to fourfold elevations 
in 17-OH-CS levels associated with even relatively brief, shock-free experimental 
exposures to this performance requirement, marked differences in the hormone 
response have been observed (167) when the free-operant avoidance procedure 
includes a discriminable exteroceptive warning signal or when “free” shocks are 
superimposed upon the performance baseline. Significantly, the corticosteroid re- 
sponse was consistently reduced during “discriminated” avoidance sessions includ- 
ing an exteroceptive auditory stimulus presented 5 seconds before shock whenever 
15 seconds had elapsed since a previous response, though removal of the “warning 
signal” resulted in the immediate reappearance of the steroid elevations. Conversely, 
superimposing “free” or unavoidable shocks upon a well-established avoidance 
performance without a “warning signal” has been reported to produce more than 
a 100% increase over the elevated corticosteroid levels observed during the regular 
nondiscriminated Sidman avoidance procedure. 

Significant advances have also been made during the past decade in determining 
the endocrine and cardiovascular consequences of free-operant avoidance perfor- 
mance requirements during long-term studies over months and even years. A report 
by Brady (33), for example, describes the effects of repeated exposure to continuous 
72-hour avoidance over periods up to, and in some cases exceeding, one year upon 
patterns of thyroid, gonadal, and adrenal hormone secretion in a series of five 
chair-restrained rhesus monkeys. Two of the five monkeys participated in the 72- 
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hour avoidance experiment on six separate occasions over a 6-month period with 
an interval of approximately 4 weeks between each exposure. The remaining three 
animals performed on a schedule which repeatedly programmed 72-hour avoidance 
cycles followed by 96-hour nonavoidance or “rest” cycles (3 days “on” and 4 days 
“off’) for periods up to and exceeding one year. 

The two animals exposed to repeated 72-hour avoidance at monthly intervals for 
6 months showed a progressively increasing lever-pressing response rate with each 
of the six successive 72-hour avoidance sessions. In contrast, shock frequencies over 
this same period showed a sharp decline within the first two 72-hour avoidance 
sessions and remained at a stable low level for the remaining four 72-hour avoidance 
cycles. Hormone changes related to the repeated 72-hour avoidance cycles showed 
consistent and replicable patterns over the 6-month experimental period for both 
animals. During the initial experimental sessions both monkeys showed approxi- 
mately threefold elevations in 17-OH-CS levels during 72-hour avoidance and re- 
turned to near baseline levels about 6 days afterwards. The remaining four monthly 
experiments were characterized by substantial though diminished steroid responses 
(approximately twofold elevations in 17-OH-CS levels) during avoidance, with es- 
sentially the same 6-day period required for recovery of basal levels. Significant 
changes related to the extended avoidance performance were also observed in cate- 
cholamine, gonadal, and thyroid hormone levels, with recovery cycles extending in 
some instances (thyroid) for 3 weeks following the 72-hour avoidance period. A 
detailed experimental and interpretive analysis of such multiple hormone changes 
induced by exposure to the 72-hour Sidman procedure has been provided in an 
exhaustive multiauthored monograph (166) describing this most systematic labora- 
tory study series yet to appear in the psychoendocrine literature. 

The three remaining monkeys described in the Brady (33) report as performing 
on the 3 days “on,” 4 days “off” avoidance schedule showed an initial increase in 
lever-pressing response rates for approximately the first 10 avoidance sessions simi- 
lar to that seen with the two animals described above. By approximately the 20th 
weekly session with these animals, however, lever-pressing response rates during the 
72-hour avoidance period had decreased to a value well below that observed during 
the initial avoidance sessions, and the performance tended to stabilize at this new 
low level for the ensuing weeks of the experiment. In contrast, shock frequencies 
for all animals quickly approximated a stable low level within the first two or three 
exposures to the avoidance schedule and seldom exceeded a rate of two shocks per 
hour for the remainder of the experiment. The initial 72-hour avoidance sessions 
were also characterized by significant elevations in 17-OH-CS levels. In the succeed- 
ing weeks, 17-OH-CS levels gradually declined but rose again by the 30th week. The 
general pattern obtained was replicated with all three animals with only minor 
variations, though the change in responsivity of the pituitary-adrenal system to the 
avoidance stress with continued exposure to this procedure over extended time 
periods was perhaps the most consistent and striking observation. These findings are 
somewhat at variance with the repeated observations made in many previous acute 
studies (33, 35, 167) of a close positive relationship between steroid elevations and 
avoidance performance, and indicate that continued exposure to this repeated per- 
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formance requirement on the time schedule programmed in this experiment pro- 
duces an apparent dissociation between the avoidance performance and the 
17-OH-CS response. Although a definitive analysis of such interactions is not possi- 
ble on the basis of these data alone, a critical role of the temporal parameters 
(work-rest cycles) is clearly indicated, and the relationship of these phasic hormone 
changes to the previously described classical and concurrent cardiovascular condi- 
tioning effects would seem to require detailed investigation. 

A trend toward more extended periods of experimental observation and measure- 
ment has been apparent also in concurrent avoidance studies focusing upon cardio- 
vascular changes, particularly in primates. Both rhesus (102) and squirrel monkeys 
(121) have been reported to develop hypertensive blood pressure levels with recur- 
rent exposure to free-operant avoidance requirements for periods up to and exceed- 
ing 12 months. Chair-restrained baboons (91) performing on a discrete-trial fixed 
ratio instrumental escape-avoidance procedure, however, were not found to main- 
tain elevated blood presure levels over the year or more during which they partic- 
ipated in the study (90). Indeed, the baboons in this extended study did show 
substantial pressure increases during the actual escape-avoidance performance inter- 
vals within the daily experimental sessions, and there were some periods during the 
first several months on the program which were characterized by general elevations 
in both blood pressure and heart rate. But the chronically high cardiac output levels 
(i.e. heart rate) maintained apparently by the extended exposure to work-activity 
requirements of the ratio escape-avoidance schedule in this study suggest procedural 
differences which may account, at least in part, for the long-term return to normo- 
tensive pressure levels not recovered in previous studies (102, 121) with exposures 
of comparable duration to less demanding free-operant avoidance requirements. 

A recent series of studies on cardiovascular changes associated with operant 
avoidance procedures (7-12) provides further evidence which is at least consistent 
with the hypothesized relationship between muscle activity and the dynamic inter- 
play between cardiac output and peripheral resistance in the behavioral pathogene- 
sis of hypertensive conditions. The focus of these studies with dogs has been upon 
continuous monitoring of blood pressure and heart rate during free-operant (panel 
press) shock avoidance and, significantly, during a fixed-interval pre-avoidance 
period systematically programmed to precede the required avoidance performance. 
Under these conditions a unique divergence between heart rate and blood pressure 
changes was observed during pre-avoidance intervals up to 15 hours in length, with 
virtually all animals showing a characteristic systolic and diastolic increase and 
either a decrease or no change in heart rate. Comparisons involving similar perfor- 
mance requirements on a variable interval food reinforcement schedule revealed a 
markedly different pre-performance cardiovascular pattern characterized by sys- 
tematic increases in both heart rate and blood pressure. And this differential 
“preparatory” pattern was confirmed both between individual animals maintained 
separately on each of the procedures and “within” the same animal alternately 
performing on the avoidance and food reinforcement schedule. Moreover, direct 
measurements of cardiac output in dogs prepared with aortic flow probes during 
exposure to the avoidance program confirmed that the pre-avoidance pressure 
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changes were attributable to increased peripheral resistance while the pressure 
increases during the avoidance performance per se occurred as the peripheral resis- 
tance was actually observed to decrease and the cardiac output increased markedly. 
Additional studies involving beta-adrenergic blockade with the drug propranolol 
during the same experimental procedure, however, clearly showed that peripheral 
resistance levels do increase to maintain elevated pressure levels during avoidance 
when drug-induced heart rate reductions produce decreases in cardiac output (10). 

The results of these experiments establish firm relationships between a broad 
range of endocrine and cardiovascular response processes and free-operant behav- 
ioral performances. Both general and specific support for these findings have now 
been provided by numerous published reports with rodents, carnivores, and pri- 
mates (15, 20, 27, 35-39, 41-44, 46, 53, 54, 58, 101, 103~106, 109, 113, 114, 122, 
124, 130, 132, 139, 145, 151, 154, 155, 163, 165, 175, 177, 179, 239, 241, 242, 244, 
251, 261). In many respects, the changes in absolute levels of selected hormones and 
autonomic activity can be viewed as reflecting relatively undifferentiated conse- 
quences of arousal states associated with behavioral responses under aversive condi- 
tions. The definite temporal course of visceral and steroid changes under such 
conditions and the quantitative nature of the relationship between degree of behav- 
ioral involvement and level of physiological response have been well documented. 
In addition, the critical role of an organism’s behavioral history in determining the 
nature and extent of autonomic-endocrine response to performance situations has 
been convincingly demonstrated. Clearly, however, the most meaningful dimension 
for hormone and visceral analysis in relationship to more chronic behavioral interac- 
tions would appear to be the broader patterning or balance of secretory and visceral 
changes in many interdependent autonomic and endocrine systems which in concert 
regulate metabolic events. The extensive and prolonged participation of these funda- 
mental systems in behavioral interactions suggests a relationship between such 
physiological activity and the more durable consequences of performances under 
aversive control. Indeed, the differentiation of such autonomic-endocrine response 
patterns in relationship to the historical and situational aspects of behavioral events 
may well provide a first approximate step in the direction of identifying distinguish- 
able intraorganismic consequences associated with both episodic and persistent 
behavioral interactions. 


Gastrointestinal Effects 


While the obvious constraints imposed upon laboratory studies of behaviorally 
induced somatic changes have continued to limit the range of experimental activity 
in this general area over the past decade, the recent research literature does reflect 
an abiding concern with concurrent conditioning effects involving the gastrointesti- 
nal system (2, 235). Of particular interest would seem to be the rather prominent 
controversy over the factors that influence the incidence of peptic ulcers in rodents 
and primates under aversive behavioral control. Some further support for the 
efficacy of “conflict” and related conditioning procedures in the production of 
gastric lesions in laboratory rats has been provided by studies focusing upon a 
variety of social-psychological and physiological parameters (158, 210-214), but 
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replication and confirmation of the reported relationships continue to present prob- 
lems (3, 4, 189). Similarly, recurrent descriptions of avoidance conditioning effects 
upon the gastrointestinal system have characteristically presented something less 
than a consistent picture with regard specifically to the conditions under which 
erosions of the gastric mucosa are most likely to occur. The reported incidence of 
peptic ulcers in rhesus monkeys intermittently exposed to a free-operant shock- 
avoidance procedure (47) has proved difficult to repeat under some laboratory 
conditions (100), including those under which the study originated (32). Addition- 
ally, several investigations with laboratory rats on escape-avoidance procedures 
have failed to find an incidence of gastric lesions in experimental animals which 
exceeded that of controls, and in some instances yoked control animals receiving 
unavoidable shocks alone showed a greater degree of ulceration than their avoiding 
partners (178, 192, 256). 

To some extent a clarification and at least partial reconcilation of these apparently 
conflicting developments in the delineation of behavioral conditioning effects upon 
the gastrointestinal system has been suggested by Weiss in a systematic series of 
published experimental reports from the Rockefeller University (255-258). Starting 
with the observation that laboratory rats receiving intermittent tail shock following 
presentation of a 10-second beeping tone developed significantly less gastric ulcera- 
tion than animals receiving the same shock without the “predictability” provided 
by the pre-aversive “warning” stimulus, Weiss examined the effects of adding an 
Operant escape-avoidance (“coping”) panel-press to the procedure. Under these 
conditions markedly fewer gastric lesions were found in the experimental animals 
when compared with “helpless” controls similarly exposed to warning signals and 
shocks (one per minute for 21 hours) but without escape-avoidance “coping.” When 
the interactions between warning signals and the escape-avoindance responses were 
tested in a subsequent experiment involving groups differentiating between two 
types of pre-aversive stimuli (“beeping” tone and an “added clock”), presence or 
absence of escape-avoidance wheel-turning, and appropriate “yoked” controls, the 
results provided Weiss with a basis for proposing a “theory” to reconcile the 
apparent contradictions between the rat and monkey studies. While the outcome of 
this rather mammoth 180-rat experiment confirmed the prepotence of the operant 
escape-avoidance conditioning in reducing the incidence of ulcers, the addition of 
a warning signal to the procedure was found to attenuate the development of gastric 
pathology even further. It is of some interest to note that a similar warning stimulus 
procedure has been found to reduce the steroid response to operant avoidance 
performance requirements in the monkey (167). 

In Weiss’ view, these findings indicate that the incidence of peptic ulcers may be 
a function of the interaction between strength of the escape-avoidance performance 
(i.e. the frequency of “coping” responses) and the probability of discriminable 
response-contingent signals associated with the absence of aversive stimuli (i.e. 
“feedback” about shock-free conditions). Within the framework of this interpretive 
analysis, the incidence of peptic ulcers in free-operant avoidance monkeys would be 
accounted for in terms of a high response frequency in the absence of warning 
stimuli and the relatively low “feedback” discriminability of “safe” signals produced 
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by those responses. The yoked-control monkeys, in contrast, characteristically emit- 
ted “avoidance” responses only infrequently, received only a few shocks well distrib- 
uted in time (due to the high performance rates of the experimental animals), and 
were found to be free of gastrointestinal pathology. Some further confirmation of 
this formulation has been provided by Weiss in a subsequent series of experimental 
manipulations which increased the frequency of ulcers in avoidance rats punished 
with shock for responding, and decreased the incidence of ulcers in animals produc- 
ing a brief tone with each shock-postponing panel-press (257, 258). 

It is perhaps worth noting (though a bit out of place in the “systems” organization 
of this section) that Weiss and his colleagues at the Rockefeller University have also 
found hormonal and body weight changes which reflect the interactions suggested 
by the ulcer studies, and that levels of brain norepinephrine are increased in escape- 
avoidance animals and decreased in nonperforming (“helpless”) shocked rats (259- 
261). 


INSTRUMENTAL AUTONOMIC CONDITIONING 


Animal studies concerned with the experimental analysis ofinstrumental autonomic 
conditioning represent a relatively recent development in the visceral learning field, 
with laboratory roots originating in the work of Neal Miller and his students at Yale 
in the mid-1960s. Indeed, several earlier reports with human subjects (60, 108, 131, 
141, 143, 157, 208, 224, 227) had foretold of such “operant” learning effects involv- 
ing visceral and autonomic processes, and an extensive literature on “voluntary” 
physiological control by Yoga meditation and breathing techniques (5, 6, 14, 262, 
263) has long been available. But the experimental analysis of such instrumental 
autonomic conditioning effects in the animal laboratory has clearly activated a new 
research era in the investigation and application of such “visceral learning” phenom- 
ena. 

The earliest reported animal learning experiments on instrumental autonomic 
conditioning involved an attempt by Miller & Carmona (172) to change the rate of 
salivation in a water-deprived dog by operantly reinforcing both increases and 
decreases in this antecedent autonomic response with a contingent environmental 
consequence (i.e. water reward). Although the results of this study showed clearly 
that such autonomic responses could be operantly controlled, attention was focused 
upon the possible role of skeletal muscle activity as a “mediator” of the observed 
visceral changes. Since the curarization technique used to control such skeletal 
muscle mediation produced direct effects upon salivation, an experiment by Trowill 
(247) explored the operant control of Aeart rate in curarized laboratory rats using 
rewarding electrical brain stimulation (medial forebrain bundle at the level of the 
posterior hypothalamus) as a contingent reinforcing consequence. Although the 
actual changes were small, both increases and decreases in heart rate were success- 
fully conditioned. A subsequent study by Miller & DiCara (173) showed that the 
magnitude of the instrumentally conditioned heart rate response could be influenced 
dramatically (producing changes approximating 20% of basal values) by a “shap- 
ing” procedure which required the animals to meet a progressively more difficult 
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criterion in order to obtain the rewarding brain stimulation. In addition, this experi- 
ment also demonstrated that such operantly conditioned autonomic changes could 
be brought under discriminative control of the specific stimulus complex which 
provided the occasion for reinforcement of either the heart rate raising or lowering 
response. In a further confirmation of this instrumental heart rate conditioning 
effect, Hothersall & Brener (127), using curarized rats and electrical brain stimula- 
tion reward, incorporated a feedback light whenever the prescribed criterion was 
met, and extended their investigation to include a demonstration of operant extinc- 
tion when the instrumentally conditioned heart rate response was no longer rein- 
forced with brain stimulation. 

Persistent concern with the degree of skeletal involvement in such instrumental 
autonomic conditioning was reflected in a series of experiments by Black (22) with 
dogs initially trained on a lever-pressing shock-avoidance task and subsequently 
curarized for operant conditioning of either e/ectromyographic or heart rate re- 
sponses. The level of curarization insured little or no overt movement but did not 
completely eliminate the EMG response. The results showed that the instrumentally 
conditioned heart rate changes were closely associated with the conditioned EMG 
changes which readily transferred to affect the performance in the noncurarized 
state. In a later report, however, Black (24) concluded that the heart rate response 
could be conditioned independently of overt movement, without conceding that the 
operantly controlled autonomic changes occurred in the absence of some central 
events related to the initiation and performance of skeletal motor responses. DiCara 
& Miller (68) had in fact hypothesized that such central activity (e.g. motor cortex 
impulses) classically conditioned to elicit heart rate changes might account for the 
demonstrated autonomic effects, but on the basis of an experiment with curarized 
rats using a tail-shock control, they argued that such indirect skeletal motor influ- 
ences could not provide a sufficient account of the instrumentally conditioned heart 
rate change. They further showed (68) that the completely curarized rat (i.e. no 
EMG responses from the gastrocnemius muscle) could learn to both increase and 
decrease heart rate as an operant shock-avoidance response, thus establishing that 
the instrumental autonomic conditioning effect was not an artifact of the electrical 
brain stimulation reinforcer. The results of this study also showed that discrimina- 
tive control over the conditioned heart rate change could be developed and main- 
tained by a stimulus that always preceded shock presentation. Such instrumentally 
learned heart rate changes have also been shown to persist in the absence of rein- 
forced practice trials over extended periods (e.g. 3-month retention tests) and to be 
relearned after extinction (71). 

Transfer of learning effects have also been demonstrated in a series of studies by 
DiCara & Miller (72, 73) in which heart rate changes instrumentally conditioned 
under curare were subsequently (one week later) observed in free-moving rats, and 
additional training in the noncurarized state was shown to produce changes of even 
greater magnitude. Similar transfer was also demonstrated from the noncurarized 
to the curarized state, and the differences in respiration and gross movement were 
found to decrease as the differences in heart rate increased. This emergent response 
specificity has also been documented in an experiment by Miller & Banuazizi (171) 
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in which independent operant control of both heart rate changes and intestinal 
contractions was demonstrated, and Fields (89) has subsequently demonstrated the 
remarkable specificity of such conditioning effects by producing instrumentally 
learned increases or decreases in the P-R interval of the EKG independently of 
changes in the P-P interval. That the issues related to a linkage between somatomo- 
tor and cardiovascular activities in the instrumental autonomic conditioning process 
have not been definitively settled, however, is clearly evident in a recent report by 
Goesling & Brener (112) which showed that two different training procedures (i.e. 
immobility training vs activity training) can have a greater effect upon heart rate 
changes in curarized rats than the reinforcement contingencies per se. Prior discrim- 
ination training, however, appears to have no effect upon the magnitude of the 
changes produced by subsequent instrumental heart rate conditioning in curarized 
animals (230). 

Interaction effects involving operant heart rate conditioning and other related 
psychophysiological processes have been investigated in a number of animal learn- 
ing studies. Curarized rats operantly pretrained to decrease heart rate, for example, 
have been shown to subsequently acquire (in the noncurarized state) shuttle-box 
escape-avoidance behavior more readily than rats similarly pretrained to increase 
their heart rate (76). It has also been reported (84) that blood pressure changes were 
significantly positively correlated with heart rate decreases instrumentally condi- 
tioned as an avoidance response in rhesus monkeys, but such blood pressure effects 
were uncorrelated with instrumentally conditioned heart rate increases in the same 
animals. Differences in the opposite direction have been reported with respect to 
epinephrine and norepinephrine by DiCara & Stone (75), who found higher endoge- 
nous cardiac and brainstem catecholamine levels in rats operantly trained to in- 
crease heart rate as compared to rats instrumentally conditioned to decrease heart 
rate. Cardiac H’-norepinephrine retention studies by these same authors, however, 
suggested that rats trained to decrease heart rate under curare were subjected to 
greater stress than rats trained to increase heart rate. Of additional interest in this 
regard is the finding by DiCara et al (67) that an intact neocortex is essential for 
instrumental autonomic conditioning, though this appears not to be the case with 
respect to the classical conditioning of the same heart rate and gastrointestinal 
responses. 

The instrumental conditioning of blood pressure in the curarized rat was convinc- 
ingly demonstrated by DiCara & Miller (69) using a shock avoidance procedure to 
reinforce both increases and decreases in systolic pressure levels independently of 
changes in heart rate and rectal temperature. And in a subsequent study, these same 
investigators (70), using electrical brain stimulation as a reinforcer with the cura- 
rized rat, dramatically confirmed the specificity of such instrumental autonomic 
learning by selectively conditioning vasomotor tone increases in one ear and 
vasomotor tone decreases in the other ear of the same animal. Significantly, these 
conditioned blood Row changes were not correlated with heart rate, rectal tempera- 
ture, or vasomotor tone in the tail, suggesting a remarkable and previously unrecog- 
nized localization of sympathetic action. Following a replication of these findings, 
(69), Pappas et al (188) further demonstrated that instrumentally conditioned sys- 
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tolic blood pressure increases and decreases in noncurarized rats did not transfer 
to the curarized state, but that retraining the same animals after curarization pro- 
duced even larger magnitude pressure changes than in the noncurarized state. 
Similar observations with respect to the specificity of instrumentally conditioned 
cardiovascular changes previously reported by DiCara & Miller (69, 72) received 
additional support from the finding by Pappas et al (188) that the operantly condi- 
tioned blood pressure effects were independent of heart rate and gross skeletal 
activity. 

Large magnitude diastolic blood pressure elevations (50-60 mm Hg) conditioned 
instrumentally as a shock avoidance response in the rhesus monkey were first 
reported by Plumlee (203), though the relatively short duration of the changes and 
the observed postural effects suggested mediation by a Valsalva maneuver (i.e. 
alteration of intrathoracic pressure by abdominal muscle contraction). Somewhat 
more modest elevations in mean arterial pressure (e.g. 25 mm Hg) were maintained 
in squirrel monkeys by Benson et al (18) for periods of 20 minutes or longer as a 
result of an operant reinforcement contingency arrangement which required the 
indicated pressure change as a shock avoidance response. And Harris et al (115) 
have also shown that substantial elevations in both systolic and diastolic blood 
pressure (e.g. 50-60 mm Hei could be established and maintained in dog-faced 
baboons by an operant conditioning procedure which provided for food delivery and 
shock avoidance programmed as environmental consequences contingent upon pre- 
scribed increases in diastolic pressure levels. The instrumentally conditioned blood 
pressure changes were sustained for intervals up to and exceeding 5 minutes and 
appeared to bear somewhat systematic but complex temporal relationships to varia- 
tions in heart rate. More recently, these same authors (116) have extended this basic 
instrumental autonomic conditioning procedure with the baboon to produce more 
sustained and clinically relevant increases (30-40 mm Hg) in both systolic and 
diastolic blood pressure throughout daily 12-hour experimental sessions. Signifi- 
cantly, the maintained instrumentally conditioned blood pressure increases were 
accompanied by elevated but progressively decreasing heart rate levels. 

Instrumentally conditioned glandular response changes have also been demon- 
strated in an experiment with curarized rats rewarded by electrical brain stimulation 
for both increases and decreases in the rate of urine formation by the kidney (174). 
Using inulin-'*C and tritiated p-aminohippuric acid, it was also determined that 
both glomerular filtration rate and renal blood Bow were systematically altered by 
the operant conditioning procedure, though heart rate, blood pressure, and periph- 
eral blood flow were not, confirming the high degree of specificity and localization 
of action emphasized in previous reports (69, 70, 89, 171). More recently, Banuazizi 
(16) has extended the experimental analysis of selectively conditioned intestinal 
contractions in the curarized rat by demonstrating discriminative stimulus control 
of both short- and long-duration intestinal responses operantly reinforced by shock 
avoidance. Significantly, the study also included a control for the unconditioned 
effects of the electric shock as a possible “state dependent” influence upon the 
contractile response, and confirmed the instrumental intestinal conditioning under 
even these more stringent requirements. 
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The role of feedback stimulation in the establishment and maintenance of instru- 
mentally conditioned visceral and autonomic response changes has been emphasized 
in several recent studies (115, 116, 127, 144, 152) despite the somewhat equivocal 
status of the “interoceptive discrimination” issue as reflected in the literature of the 
past decade (133, 134, 164, 231, 232). The early and more recent studies of Miller 
and his students as reviewed above involved reinforcing stimulus changes (e.g. 
electric brain stimulation) which provided immediate feedback following visceral or 
autonomic response variations. As instrumental autonomic conditioning effects of 
more extended duration have been investigated, however, exteroceptive stimuli (e.g. 
lights, tones) linked with the internal environment by advanced electrophysiological 
recording and amplification techniques (52, 55, 117, 148) have provided both digital 
and analogue representations of critical interoceptive events and processes. In addi- 
tion, such feedback stimuli serve as conditioned reinforcers bridging the temporal 
gap between the visceral response and its maintaining environmental consequences. 
The effectiveness of such stimulus feedback applications has been convincingly 
demonstrated in the instrumental heart rate conditioning studies of Hothersall & 
Brener (127) with curarized rats, and the operant blood pressure conditioning 
experiments of Harris et al (115, 116) with laboratory baboons. 

Despite this emergent operational orientation, interpretive and theoretical ac- 
counts of instrumental autonomic conditioning procedures and results continue to 
focus upon “mediational” issues as these have been reviewed and discussed at length 
in several recent reports (59, 137, 138, 142, 219, 220). As fruitless as the resulting 
regress to “pure” operant status may now appear, the controversies surrounding 
experimental attempts to control such “voluntary mediators” in instrumental auto- 
nomic conditioning studies have at least emphasized the need to reexamine some 
basic formulations regarding conventional distinctions between “the two types” of 
learning (218, 221). With respect to more focused concern involving the interrela- 
tionship between autonomic-visceral and somatomotor activity, two more or less 
distinguishable points of view can be identified. On the one hand, Miller, DiCara, 
and associates (66, 68-70, 170, 171, 188, 247) have appeared to take the position that 
evidence from their own experiments and those of others (223, 225) supports the 
independence of somatomotor and autonomic-visceral control. Black (22-24), Bren- 
ner & Goesling (49), and Obrist et al (187), on the other hand, prefer to represent 
autonomic-visceral and somatomotor activities as two components of a more gen- 
eral, centrally controlled response process. That the dividing line between the two 
formulations may not be too firmly drawn, however, would seem to be indicated by 
the fact that virtually all the adherents to the latter school of thought (25, 112, 187) 
appear willing to concede that the postulated “normal” linkage between the two 
systems may be modified in a variety of ways. Indeed, a more moderate “separable 
but interacting” formulation (40) of the observed psychophysiological relationships 
may better serve the purposes of both clinical and experimental investigators con- 
cerned with the conditions under which dissociation or decoupling of the two 
systems can and do occur in the course of ongoing behavioral transactions between 
organism and environment. 
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A cautionary note must be inserted before concluding the review of instrumental 
autonomic conditioning. As Neal Miller pointed out in his 1972 presidential address 
to the Society for Neuroscience, and has reported in print (170a), there has been a 
progressive decline in the magnitude of learned changes in heart rate from the first 
experiment conducted in 1966 (173) through those of 1972. A number of factors are 
being investigated in order to try to account for this “extraordinarily perplexing and 
vexing phenomenon,” as Miller terms it, but no proposed explanation for the decline 
has yet been confirmed. 

Be all this as it may, the fact remains that the field of animal learning appears 
now to be extending the horizons of conditioning to probe the depths of “inner 
space” (92) with the same incisive methodological approach that has proved so 
productive in analyzing more conventional behavioral interactions with the external 
environment. The potential of this investigative endeavor to provide new insights 
into the behavioral control of physiological processes holds the promise of an active 
participant role for man in learning to deal effectively with his proverbial “worst 
enemy.” 
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EXPERIMENTAL #213 
PSYCHOLINGUISTICS 


P. N. Johnson-Laird 
University of Sussex, Brighton, England 


The fundamental problem in psycholinguistics is simple to formulate: what hap- 
pens when we understand sentences? Most of the other problems would be half-way 
solved if only we had the answer to this question; and potential answers to it, of 
course, have accumulated rapidly ever since George Miller (125) introduced psy- 
chologists to the ideas of transformational grammar. The results of the initial phases 
of experimentation are well known (see, for example, 73, 83, 160); and since Fillen- 
baum (52) has recently contributed a wide-ranging survey of the area, including 
language acquisition and its biological bases, it is appropriate in the present review 
to concentrate on comprehension and its cognate problems. 

Because meaning depends on words and syntax, there is a general consensus that 
comprehension involves processing both of them and combining the meanings of 
words according to the perceived syntactic relations between them. But comprehen- 
sion is only a start: once an individual understands a sentence he may react to it. 
He may try to verify a statement, or to answer a question, or to carry out a 
command. He may simply remember the sentence or one of its implications. Many 
of his tangible responses are of immediate intrinsic interest to a psycholinguist. But 
the responses do have a further use. If an individual can decide that a sentence is 
true, then he is likely to have come to an understanding of it, and his performance 
may therefore be very revealing about this process. Such indirect approaches to the 
study of comprehension are extremely useful because there is no simple veridical 
index of when a sentence has been understood. 

The literature continues to grow faster than knowledge, and in trying to epitomize 
its findings and foibles, we shall follow the progress of a sentence as its syntax is 
parsed, its word meanings retrieved from lexical memory, and its sense ultimately 
understood. We shall examine such matters as how sentences are verified, and how 
questions are answered. Finally, we shall turn to the ways in which sentences and 
larger units of discourse are remembered. 
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THE PERCEPTION OF SYNTAX 


It was implicit in the initial phase of experimental psycholinguistics that the process 
of perception paralleled the levels of analysis in transformational grammar. Yet it 
was clear that there would have to be considerable interplay between these levels 
because purely semantic factors were known to affect the intelligibility of speech. 
Nevertheless, the pioneers believed that the linguistic distinction between surface 
and deep structure was likely to be reflected in the organization of the perceptual 
system, with one set of procedures designed to parse a sentence into its immediate 
constituents—nounphrases, verbphrases, etc.—and another set of procedures re- 
trieving the underlying structure of a sentence, including such relations as its subject 
and object. The distinction was there from the start and it has proved vital to most 
of the work in the area, particularly the sustained investigations into perception 
carried out by Bever, Fodor, Garrett, and their colleagues at MIT. 

The MIT group has made considerable use of a technique in which listeners are 
required to locate a click presented to one ear with respect to a sentence presented 
to the other ear. The subjects usually have to write down the sentence and then to 
indicate where the click occurred within it; however, comparable results appear to 
be obtained when they have merely to Judge whether a click occurred in the same 
place in two sentences (cf. the unpublished MIT study cited in 84). The first studies 
with the technique, reviewed by Bever (14), indicated that the click was often 
perceptually relocated so as to minimize the number of surface constituents that it 
interrupted. It was as though extraneous noises were not permitted to violate the 
integrity of major constituents and were consequently preposed, or postposed, into 
the boundaries between them. The results of more recent studies, however, suggest 
that whatever it is that determines click location, it is mot surface structure. 


The Reality of Deep Structure 


Reber & Anderson (148) certainly found a regression of clicks towards the major 
surface boundary within sentences. But they had purposely confounded this bound- 
ary with the midpoint of the sentences, and a similar regression occurred with both 
random strings of words and strings made up of bursts of white noise. Indeed, a 
distinct response bias towards this position was evident in the performance of a 
control group instructed to respond to “subliminal” but in fact nonexistent clicks. 
There is evidently a similar bias towards locating clicks within words rather than 
between them (109) which also tells against the hypothesis since words are surface 
constituents. Such findings naturally suggest that nonlinguistic attentional factors 
may be primarily responsible for the perceptual location of clicks. But although the 
MIT group subsequently failed to detect effects of surface structure, they have 
regularly reported another phenomenon that seems inexplicable without resort to 
linguistic principles. Clicks are consistently relocated so that they fall into or to- 
wards a boundary between the underlying clauses of sentences—even, Bever (14) 
claims, when such boundaries are unmarked in surface structure. A particularly 
striking finding involved a contrast between two superficially similar sorts of com- 
plement (18). With a nounphrase complement, an entire underlying clause is treated 
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like a nounphrase, e.g. “The corrupt police can’t bear criminals to confess very 
quickly.” A click actually located in the word “‘criminals” would often be reported 
to have occurred between “bear” and “criminals,” i.e. at the point corresponding 
to the boundary between the two underlying clauses. With a verbphrase comple- 
ment, on the other hand, there is no longer a distinct boundary between clauses, e.g. 
“The corrupt police can’t force criminals to confess very quickly.” Here “criminals” 
serves as the object of “force” as well as the subject of “confess”; and a click located 
in the word “criminals” no longer showed any systematic tendency to be preposed. 
It is debatable whether or not there are concomitant differences in the surface 
structure of these two sorts of sentence. Chapin and his colleagues (30) were un- 
happy about this aspect of the materials and undertook a more direct test of Bever’s 
conjecture. They found that clicks placed midway between a major surface bound- 
ary and a subsequent underlying clause boundary were attracted to the surface 
boundary; but when the clause boundary preceded the major surface boundary, 
locational errors were more or less equally divided between them. It seems likely 
that the technique will have to be refined in order to resolve these inconsistencies. 
One feasible modification would be to utilize the paradigms of signal detection, 
setting listeners to detect the presence of a faint click in a specified region of a 
sentence or forcing them to choose between a number of possible alternative loca- 
tions. These procedures may show that there are genuine fluctuations in perceptual 
sensitivity during the syntactic analysis of sentences. Such a view is plausible because 
sensitivity to the location of a switch in speech from one ear to the other is greatest 
at boundaries between clauses (183), and similarly reaction times to clicks are faster 
when they occur in clause boundaries (84). Meanwhile, however, what are we to 
infer from the rather muddled history of clickology? 

The story told by Bever (16) has considerable attraction. He argues that surface 
structure has a less direct perceptual reality than underlying structure. Surface 
structure is a derived construct, and it takes time to mature in the mind of the 
perceiver. Its effects on perceptual tasks such as the location of clicks occur only 
when there is sufficient delay for its mental derivation to take place. Ordinarily, 
when an individual listens to a sentence, he is gathering information about its 
underlying structure, and it is at the end of each clause that such structure is 
assigned—-hence the relocation of clicks to clause boundaries. Bever goes on to argue 
that clauses are more directly perceived than individual words, since listeners recog- 
nize a target word in a list of sentences more quickly when they are told the sentence 
in which it will occur, even though it is the first word of the sentence (15). Moreover, 
Savin & Bever (154) argue that words are more directly perceived than phonemes. 
They found that listeners recognize a target phoneme in a list of syllables more 
quickly when they are told the syllable in which it will occur, even though it is the 
first phoneme of the syllable. This topsy-turvy conception, from which it seems to 
follow that deep structures are more directly apprehended than phonemes, makes 
it difficult to explain such everyday phenomena as the detection of mispronuncia- 
tions, neologisms, and solecisms. Yet is is a useful supplement to the more conven- 
tional order of processing--from sound through syntax to sense. It embodies the 
important truth that the perceiver does not come empty-handed to his task. He is 
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likely to have a number of conjectures about the meaning of a sentence, derived from 
his immediate or general semantic knowledge. He is likely to possess a variety of 
perceptual heuristics to aid him in the assignment of underlying structure, e.g. the 
occurrence of a noun-verb-noun sequence normally signifies the underlying roles of 
actor-action-object (14). 

If a conventional order of processing is permitted to augment Bever’s scheme, 
then it is feasible that a perceiver also exploits cues in the surface structure of a 
sentence in order to recover its underlying structure (55). Such cues appear to divide 
into two categories: the presence of grammatical formatives indicating explicit 
Structure, and the presence of certain lexical items suggesting definite hypotheses 
about structure. Particular exemplars of both sorts of cue have been investigated 
using the phoneme-monitoring task devised by Foss (59). The technique rests on the 
assumption that all aspects of perceptual analysis are carried out to some extent by 
the same central processor; hence the time taken to recognize a target phoneme 
provides a useful index of the load placed on the processor by the syntax of a 
sentence. The general assumption appeared to be justified by the finding that re- 
sponses were slower when a target followed a relatively infrequent word, although 
the effect only occurs with adjectives (24). One salient cue to underlying structure 
is the presence of relative pronouns in embedded sentences. For example, “The car 
that the man whom the dog bit drove crashed” seems easier to understand than 
“The car the man the dog bit drove crashed.” Despite an early setback (62), Foss 
and Hakes and their co-workers have established that responses to a target phoneme 
in an embedded sentence are faster when it contains relative pronouns (77, 78). 
Hakes (76) has demonstrated in the same fashion that signaling the occurrence of 
a complement by the presence of “that” makes a sentence easier to parse. 

The effects of lexical cues to underlying structure have been examined only with 
verbs, perhaps because they differ so markedly in the number and variety of struc- 
tures they may take as objects. Verbs that can take complements should cause more 
difficulties than verbs that can take only simple direct objects, since whenever a 
complement verb occurs a greater number of structural hypotheses have to be tested 
(56). However, Hakes (75) failed to detect any effect of verb complexity upon a 
phoneme-monitoring task. A possible explanation for this failure is suggested by the 
results of an experiment carried out by Holmes & Forster (85). They used a tech- 
nique invented by Forster (57) in which the words of a sentence are superimposed 
one after the other in a rapid serial visual presentation. Under this regime, the 
perception of sentences was influenced by the sort of verb they contained. Sentences 
with simple transitive verbs were easier to perceive accurately than sentences with 
nounphrase complement verbs, and when such a verb was followed by an actual 
complement, e.g. “Bill wanted his father to leave on the last bus,” the sentence was 
still harder to perceive. A very different pattern of results emerged with verbphrase 
complement verbs. The presence of such a verb, e.g. “The judge warned the youth 
about the dangers of drugs,” made a sentence more difficult to perceive, but this 
difficulty vanished completely when such a verb was followed by an actual comple- 
ment, e.g. “His mother allowed the boy to swim in the pool.” This surprising 
facilitation is probably due to the fact that verbphrase complements are much more 
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constrained than nounphrase complements: they must contain an explicit subject, 
which is almost invariably animate. Since Hakes (75) had used an identical sentence 
frame for presenting both the simple and the complement verbs, his technique may 
have been insensitive to such subtle differences between them. This explanation may 
also account for a recent failure to find an effect of verb complexity on judgments 
of grammaticality (71). 


The Unreality of Deep Structure 


For obvious historical reasons most psycholinguists have maintained a continued 
intellectual loyalty to the basic tenets of transformational grammar. Yet if Chomsky 
and his colleagues had not postulated the existence of deep structure, psycholin- 
guists might never have invented it. The studies of click location show that clicks 
migrate sometimes to the boundaries between underlying clauses and sometimes to 
boundaries between major surface constituents. A simple explanation for this appar- 
ent inconsistency is that the technique may be primarily sensitive, not to the assign- 
ment of syntax, but to the amalgamation of meaning. The bulk of semantic 
processing obviously takes place at the end of clauses and at the end of complicated 
nounphrases. Just such nounplrases attracted clicks in the study carried out by 
Chapin and his colleagues (30). Similarly, the effects of different sorts of verb on 
rapid serial visual perception may be attributable to semantic factors: verbphrase 
complements almost invariably have animate agents, but there is no comparable 
prediction for nounphrase complements. 

This tentative conjecture is not a covert plea for a particular conception of 
linguistics, but is an argument for abandoning the idea that the deep structure of 
a sentence is separately perceived. It suggests instead that there is a single integrated 
analysis of syntax aimed at determining the meaning of a sentence. A listener 
attempts to assimilate the meaning of each incoming word to his current semantic 
representation of the sentence, or indeed to his representation of the discourse as 
a whole. He may well rely upon his knowledge of the likely meaning of a sentence 
and a variety of heuristic parsing strategies. Perhaps these two activities are con- 
ducted in parallel (cf. 14). But, if they are both geared to a semantic goal, it seems 
more likely that they will be conducted in serial fashion in order to avoid conflicts 
between them. Studies carried out by Rosenberg and his colleagues (e.g. 151) have 
indeed shown that general semantic expectations affect the intelligibility of sentences 
presented in noise. Forster & Ryder (58) have observed comparable effects upon 
rapid serial visual perception. Just as sentences with a complicated syntax were 
harder to perceive accurately than sentences with a simple syntax, so also sentences 
describing improbable situations were harder to perceive accurately than sentences 
describing mundane situations. However, when complex syntax was combined with 
improbable content, the resulting sentence was still harder to perceive, compatible 
with the view that hypotheses about sense and syntax are pursued serially. 

To deny the perceptual reality of deep structure is tantamount to denying the 
reality of surface structure, since some admixture of both sorts of information must 
be involved in syntactic processing. In what situation might it be shown that 
individuals are sensitive to unadulterated surface structure? One answer is, of 
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course, whenever there is enough time for its mental derivation. There are a number 
of studies that amply fulfil this requirement, yet which have failed to establish its 
psychological reality (cf. 111). Edwin Martin (118) found that when individuals 
were asked to sort the words of a sentence into “natural groups,” their performance 
did not reflect the surface structure of the sentence. A comparable failure occurred 
in another study (120) in which subjects indicated the locations of intonation breaks 
in sentences. An interesting finding occurred in both of these studies: when the 
object of a sentence was a complicated nounphrase, it was often separated from the 
verb. Once again, it is possible that performance reflected semantic rather than 
syntactic processing. 


THE COMPREHENSION OF WORDS 


The comprehension of words is a prerequisite to the comprehension of sentences; 
and the meanings of words must be represented in memory in a way which is 
compatible with the meanings of sentences. Yet such is the profusion of hypotheses 
about meaning (e.g. 103, 157, 165), that thinkers of a positivistic persuasion might 
be tempted to argue that the notion is gratuitous. This position is advocated by 
certain contemporary philosophers (cf. 44), but it is unlikely to find much support 
among psycholinguists, though the drift of a recent work is at times in this direction 
(133). The more dominant tendency is to search for a parsimonious theory of the 
representation of meaning. 


The Organization of Semantic Memory 


The representation of a word in a mental dictionary must include a variety of 
information—its sound and orthography, its syntactic properties, its meanings and 
any relevant perceptual features, its emotional and factual connotations. The list is 
long, and granted the efficiency with which the information can be retrieved, the 
system must be highly organized (127). The heart of the problem for our purposes 
is the organization and retrieval of semantic information. But before the meaning 
of a word can be retrieved, it is first necessary to identify the word. It would take 
us too far afield to consider the vast literature on this process; and we note in passing 
only that the effect of even an apparently simple variable such as frequency of usage 
is a matter of considerable controversy (e.g. 29, 65, 108, 131). 

Empirical studies of semantic memory are currently concerned with establishing 
the relative ascendancy of the psychological notion of an association, the linguistic 
notion of a semantic component, and the programming notion of a list structure. 
The traditional conception of an association has been reanimated by the work of 
Quillian (e.g. 147) in developing computer programs for the storage and retrieval 
of verbal information. Although this work was originally conceived as an exercise 
in artificial intelligence, Collins & Quillian (37, 40, 41) have used it as the basis for 
a model of human performance. They assume that the meaning of a word is its set 
of verbal associations, which may involve a variety of different sorts of associative 
link, including class-inclusion and part-whole relations. The organization of the 
complete associative network reflects the way in which information is initially 
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acquired. An individual is unlikely to learn as a separate fact that, for example, 
poodles are animals; rather he learns that dogs are animals and, independently, that 
poodles are dogs. Hence, the representation of “poodle” is only indirectly linked to 
the representation of “animal”; and once the network has been entered, the retrieval 
of certain facts is direct (e.g. “dogs are animals”), whereas the retrieval of other facts 
requires additional associative links to be traversed (e.g. “poodles are animals”). The 
results of several experiments suggest that the network can be entered with equal 
ease at any point (66, 113, 114), and they support the notion of a hierarchical 
retrieval of facts (37, 39, 40, 161). One unexpected finding was that although the 
closeness of items in the semantic network facilitated positive judgments about 
them, it impeded negative judgments, e.g. it is harder to evaluate “a canary is an 
ostrich” than to evaluate “a canary is a fish.” The difference has been consistently 
replicated (155, 156, 182) and perhaps constitutes the central regularity of semantic 
judgment: the greater the similarity in meaning between words, the easter it is to 
make a positive judgment, and the harder it is to make a negative judgment, about 
a semantic relation between them. The network model provides a natural account 
for the facilitation of positive judgment, but it is necessary to suppose that negative 
judgments are impeded by extraneous links between neighboring words (40). 

The regularity is, however, open to alternative explanations. Schaeffer & Wallace 
(156) argue that it is a consequence of a comparison process rather than a retrieval 
process. The meaning of a word, according to this theory, is represented by a set 
of conceptual elements analogous to the semantic components of linguistic theory 
(101). In deciding whether an item is a member of a category, the sets of conceptual 
elements for both words are retrieved and compared. The comparison involves 
sampling a certain amount of information from both sets in order to satisfy a 
criterion of judgment. Since similar words will have some common elements, the 
sampling required to satisfy the criterion will be smaller for a positive judgment than 
for a negative judgment. A still simpler explanation of the facilitation of positive 
judgments is that an individual has to search through a list of items stored with the 
category label, and so the larger the size of the category, the longer it will take him. 
Such category size effects were discovered by Landauer & Freedman (106), and 
subsequently corroborated by others (e.g. 122). They may explain what appears to 
be an effect of an associative hierarchy simply because position in this supposed 
hierarchy tends to be confounded with category size. It may take longer to decide 
that a poodle is an animal than to decide that it is a dog because there are more 
animals than dogs to be examined. 

Perhaps such a result simply reflects Marbe’s classical law of associations. The 
law states that the more frequent the associative response to a given word, the faster 
it should be made; and presumably “poodle” is more likely to elicit “dog” than 
“animal.” The existence of associative norms for categories (e.g. 9, 115) has led to 
direct tests of the law, and it has been emphatically sustained in studies of both the 
production of category members (66, 113, 116) and the categorization of given items 
(182). But what happens when associative frequency is controlled? A study by E. 
E. Smith and his colleagues (161) paired arbitrary digits with a hierarchy of catego- 
ries and still detected hierarchical effects. Comparable results were obtained in other 
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more naturalistic studies conducted by Carol Conrad. She found that when associa- 
tive frequency was held constant, hierarchical effects were present for statements 
about class inclusion but absent for statements about the properties of items (42). 
There seems to be more than Marbe’s law at work in categorization tasks; and of 
course it could never really explain the phenomena since it predicts that any item 
with a high associative frequency should be given as an instance of a category. The 
law is only a statement of a correlation between frequency and latency, and it cannot 
account for adherence to the logic of categorization. This failure brings us back to 
the three main competing approaches to semantic judgment. The attempts to tease 
them apart have been curiously unsatisfactory. No one theory has been conclusively 
vindicated, and they have all been confronted with some ugly facts. 

Perhaps the major inconvenience for the hierarchical theory is a finding reported 
by Meyer & Ellis (cited in 107). They examined the times taken to categorize a 
variety of items including pronounceable strings of letters that were not actual words 
in the language. It took longer to decide that an item like “‘mafer” was not a 
“structure” than to decide that it was not a “building.” Because “mafer” is not a 
word, it cannot be part of a semantic network, and the hierarchical theory cannot 
explain the result. The category size hypothesis, however, runs into comparable 
troubles. Collins & Quillian (38) investigated negative judgments involving items 
totally dissimilar in meaning and categories of different sizes. They found that it was 
just as easy to decide that an item such as “magnesium” was not an “animal” as 
to decide that it was not a “dog.” There were equally disquieting results with some 
positive judgments, e.g. there was no difference between categorizing birds and 
categorizing animals. Finally, Landauer & Meyer (107) have cast doubt upon the 
main regularity of negative judgments, and consequently upon both the hierarchical 
theory and conceptual element theory. They obtained rankings of the closeness of 
meaning of items involved in a categorization task. Negative judgments were 
not appreciably affected by the degree of semantic similarity between item and 
category. 

An empirically minded critic is likely to argue about the adequacy of these 
experiments and to try to draw up a balance sheet assessing their respective merits. 
Certainly there is a need for better data. No study appears to have utilized an 
entirely proper procedure for estimating category size, i.e. the number of items 
individual subjects actually recognize as members of a category. Yet a disinterested 
observer might draw a different moral. The seemingly contrary findings may reflect 
the flexibility of human semantic processing and the capability of subjects to adopt 
different strategies to meet the constraints of experimental situations. The experi- 
ments may even pass by the fundamental problem, since it is so rare in everyday 
life to have to ascertain the truth of a sentence such as “a dog is an animal.” Of 
course, one can hardly understand such a sentence without verifying it. But there 
is a considerable distinction between these two transactions for ordinary contingent 
sentences like “a dog is expensive to keep” (cf. 152). In understanding this sentence, 
an individual presumably retrieves some information from the lexical entry for 
“dog,” but what? It seems that for the most important semantic retrieval problem, 
we have no clear idea of what is retrieved, let alone how it is retrieved. 
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The Meaning of Words 


It would be dangerous, even if it were possible, to decide between the competing 
theories of semantic memory solely upon the results obtained from studies of catego- 
rization. Other experimental tasks have, of course, been used to investigate lexical 
memory. They range from studies of judgments about whether or not strings of 
letters are words (e.g. 124, 153, 163) to procedures involving memory for words (e.g. 
5, 74, 178). But these studies have been concerned mainly with matters other than 
a systematic assay of the alternative theories of semantic memory. From this point 
of view it is particularly unfortunate that the categorization studies have concen- 
trated on the organization of nouns, since in them the distinction between linguistic 
and factual knowledge is especially blurred. On what basis, for instance, do we 
assign things to the category of “birds”? Are the criteria linguistic or factual? 
Indeed, perhaps there are no criteria, but merely a list of exemplars from whose 
shifting attributes a disjunctive set of family resemblances may be inferred. Yet some 
characteristics are more salient than others. Eleanor Heider (81) has shown that 
apparently there are different degrees of category membership. Her subjects consid- 
ered that, for example, robins and eagles come closer to the ideal bird than do 
chickens and ducks. And, as the linguist George Lakoff has pointed out, we appear 
to recognize a variety of different sorts of criteria whenever we hedge our remarks 
in such characteristic ways as: "A robin is a typical bird”; “Strictly speaking, an 
ostrich is a bird”; “Superman is a regular bird.” 

Many of these complexities melt away when one turns to verbs or to relational 
terms in general. Apart from technical vocabulary, their meanings rarely embrace 
a fuzzy boundary between linguistic and encyclopedic knowledge, but the relations 
between them are correspondingly more complex. Simple hierarchies give way to 
intricate cross-classifications; for example, such verbs as “travel,” “own,” and “see” 
lack a causal component that is present in the meaning of “lift,” “seize,” and “look 
at,” and these verbs in turn lack an intentional component that is present in “chase,” 
“buy,” and “look for.” An alternative and more salient classification of these exam- 
ples, which are taken from an unpublished analysis by George Miller and the present 
author, is simply into verbs of motion, verbs of possession, and verbs of perception. 
It is unlikely that mere lists of verbs in semantic memory could handle these 
subtleties. And since the network theory has no way of freeing meaning from the 
specific verbiage in which it is expressed, it seems committed to the dubious view 
that the meanings of verbs are acquired, in part, from explicit statements involving 
“cause” and “intention.” Only a theory countenancing underlying conceptual ele- 
ments seems to offer sufficient flexibility to account for the acquisition of meaning. 
Apart from such global considerations, however, it is difficult to distinguish between 
a semantic network and a set of conceptual elements. Studies of semantic judgment 
often yield evidence compatible with both theories. For instance, Steinberg (e.g. 164) 
found that sentences like “The husband is a wife” are considered to be contradic- 
tory, whereas sentences like “The chair is a husband” are considered to be nonsensi- 
cal. The difference can be explained either by postulating an ordered set of semantic 
components representing the meaning of a word or by reference to the likely dis- 
tance between words in the semantic network. 
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One factor that has undoubtedly shaped the direction of research into semantic 
memory, at least in a negative way, has been a lack of systematic knowledge about 
the meanings of words. It is difficult to study the organization and retrieval of 
information without a specification of what information is involved. Although a 
number of empirical techniques are helpful (e.g. 54, 136), there seems to be no 
substitute for painstaking linguistic analysis. Unfortunately, linguists have been 
reluctant to carry out such lexicographical projects in the detail required by psychol- 
ogists. Perhaps the sort of approach that will have to be adopted is exemplified by 
George Miller’s recent study of verbs of motion (128). Miller used a novel method 
of “incomplete definition” in which his intuitive choice of paraphrases revealed the 
major semantic elements of each verb. The resulting analysis was tested empirically 
at various points by using a technique in which a subject has to sort words according 
to his perception of their similarity in meaning. This technique has been developed 
over a number of years for use with both adults and children (e.g. 3, 126), since it 
provides a stringent test of semantic analyses. Once a beachhead into the English 
lexicon has been established in this way, it may be possible to devise suitable 
discovery procedures, 


THE COMPREHENSION OF SENTENCES 


A major obstacle to understanding the process of comprehension is its intangible 
nature—-a fact borne out by the possible semantic contamination of studies of 
syntactic perception. It can, of course, be investigated by the use of such tasks as 
verification or paraphrase, since—to use a potent metaphor—a program must be 
compiled before it can be executed (45). But the way in which comprehension occurs 
may well depend upon which of these tasks is about to be performed. Perhaps the 
simplest and oldest technique is merely to time how long it takes an individual to 
decide that he has understood a sentence. It was with this procedure, for example, 
that Danks (43) showed that comprehension appears to be disrupted more by 
semantic anomaly than by grammatical deviance, and that the effort after meaning 
sometimes led subjects to misread lexical items. It is probably important whether 
the subjects read or listen to the material. A reader, unlike a listener, can adjust the 
depth of his processing to take into account fluctuations in the difficulty of a passage 
(46). Thus readers can make rapid and accurate assessments of the comprehensibil- 
ity of a sentence, sometimes even before they have reached the end of it (158). A 
listener can, of course, elect not to pay too much attention to what someone is 
saying, but there is clearly no aural equivalent to rapid reading—it is speech that 
has to be speeded up to increase the efficiency of the system (63). 

There is a direct parallel between judgments of comprehensibility and judgments 
of grammaticality; both sorts of judgment appear to be adversely influenced by 
semantic incongruities between noun and verb (e.g. 130, 170). The crucial syntactic 
factor appears to be simply the distance between subject and verb; hence sentences 
that are self-embedded, or that have a considerable Yngve depth, tend to be rated 
as hard to understand (79, 171). It is difficult to establish whether the nouns are 
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more salient than the verb, or the verb is more salient than the nouns. But the verb 
does appear to be decisive for the control of the interpretative process; and when 
sentences are grouped together on the basis of their similarities in meaning, same- 
ness of verb rather than sameness of subject is the potent factor (cf. 80). 


Understanding Ambiguous Sentences 


Most sentences considered in isolation are to some extent ambiguous; but it is only 
rarely that listeners are perplexed by ambiguity. In some mysterious way, the 
context of an utterance usually suffices to render its meaning plain. However, 
although only a single conscious interpretation of a sentence occurs, other interpre- 
tations may have been covertly considered and rejected. Certain results support such 
a conjecture; for example, Foss (60) found that responses in his phoneme-monitor- 
ing task were slower to ambiguous than to unambiguous sentences, and Garrett (67) 
found that the location of clicks within ambiguous words was less accurate than 
when they had been rendered unambiguous by the preceding context. But perhaps 
the most striking evidence for the partial processing of more than one meaning 
comes from a study by Lackner & Garrett (105). They presented an ambiguous 
sentence to one ear with a disambiguating sentence 5 dB softer to the other ear. 
Almost without exception the sentence was interpreted in accordance with its 
disambiguating context, yet the subjects were unable to report this unattended 
sentence. In order for the context to have such an effect, it seems that both meanings 
of the sentences must be to some extent available to the subjects. All of these findings 
lend support to McKay’s theory (117) that the meanings of ambiguous sentences 
are processed in parallel and interact with one another in such a way that the 
perception of one meaning leads to the suppression of others. 

Failures to detect effects of ambiguity have been reported in a number of studies. 
It seems to take no longer to paraphrase ambiguous sentences than unambiguous 
sentences (67); and it only takes longer to verify ambiguous descriptions on those 
occasions that the ambiguity is likely to have been noticed (27). 

These superficially inconsistent findings have been resolved by Garrett (67) on the 
grounds that the experiments showing no effects of ambiguity tested subjects after 
they had understood the sentences, whereas experiments showing effects of am- 
biguity tested them during the process of comprehension. The one awkward phe- 
nomenon is the greater time required to complete an ambiguous rather than an 
unambiguous sentence fragment (117). However, a recent study (17) suggests that 
the effect appears only when the fragment ends in an incomplete clause. In other 
words, ambiguity is irrelevant once a clause boundary is passed, because a single 
meaning has generally been decided upon by then. There is, however, an alternative 
resolution of the experimental findings. An individual is likely to analyze more than 
one meaning of a sentence only if its context fails to provide a strong bias towards 
one reading. If there is such a bias, the sentence will be no harder to understand 
than an unambiguous one. If the bias turns out to be inappropriate, then the listener 
will have been led up the garden path and he will have to reconsider the sentence 
(cf. 47). It follows that where sentences are presented in contextual isolation, effects 
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of ambiguity are likely to be detected; but whene they are presented after a disam- 
biguating context, no such effects are likely to be detected. The one awkward 
exception to this principle is the failure to find any difference in the time to para- 
phrase isolated ambiguous and unambiguous semtences. But although paraphrasing 
tasks are often revealing about the search for meaning (51, 68), they are relatively 
insensitive indices of perceptual processing (75). It seems, in general, that any 
procedure permitting sentences to be considered in tranquillity is unlikely to be 
affected by their ambiguity (129). 

Students of ambiguity have been provided with a wide variety of materials by 
linguists, and they have investigated the effects. of ambiguities in both words and 
syntax. It does not seem that there are any safe generalizations that can be made 
at present about the psychological relevance cf the origin of an ambiguity in a 
sentence. However, the view that all the meanings of a sentence are analyzed to some 
extent seems more feasible when its ambiguity .s structural rather than lexical in 
origin; certainly such a distinction would explein why it is harder to reinterpret 
sentences with lexical ambiguities (cf. the study. by Cairns cited in 17). 


The Verification of Sentences 


The study of sentence verification has undergonea distinct shift of emphasis during 
the last few years. Originally the task was introduzed in order to establish differences 
in the ease of understanding sentences, as in Wason’s classic studies of negation (e.g. 
172). But psycholinguists have become increasingly interested in how an individual 
determines whether a sentence is true or false. This development is due largely to 
the independent work of Clark (32, 36) and Trabasso (167, 168), although the 
differences between their approaches are largely terminological. Both theories agree 
that the process of verification can be separatec into four main serial stages: the 
representation of a sentence, the representation of some state of affairs it purports 
to describe, the comparison of the two representations, and the mobilization of an 
appropriate response. The order of the first two stages depends upon the order of 
presentation of the materials or the order in whica they are encoded. But this order 
has important consequences. If a visual display is presented first, it will usually be 
represented in a neutral affirmative way. It it is presented after its purported descrip- 
tion, then the format of its representation will be contingent upon the nature of this 
sentence. 

Verification tasks have shown that a variety af factors appear to influence the 
initial representation of a sentence. There is evidence that negative sentences take 
longer to understand than affirmative sentences (32, 168), that explicit negatives 
such as “none” or “not present” take longer to understand than implicit negatives 
such as “few” or “absent” (32, 99), and that passive sentences take longer to 
understand than active sentences (135). But the analysis of verification has also 
stimulated interest in other linguistic phenomena, including the distinction between 
unmarked items, such as “deep” or “long,” whick can be used in both a neutral and 
a contrastive sense, and marked items, such as “snallow” or “short,” which can be 
used only in a contrastive sense. Marked items resemble negatives in that they tend 
to take longer to encode. However, there has >een sharp controversy between 
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Huttenlocher (e.g. 87, 88) and Clark (e.g. 33) about lexical marking and the strate- 
gies individuals use to solve three-term series problems. The controversy has been 
fervent rather than enlightening, perhaps because the antagonistic positions may be 
reconcilable (96). 

Although Clark (e.g. 35) originally argued that sentences are represented in the 
form of their underlying structures, the evidence for this view is sketchy and indirect 
and takes no account of the alternative hypothesis that they are represented in a 
more semantic form. No verification study has directly contrasted these two views; 
but evidently Clark (34) has partially abandoned this tenet of his theory. 

There is also a latent controversy about possible differences in picture encoding. 
Chase & Clark (31) argue, for example, that whether a display is encoded as X above 
Y oras Y below X makes no difference to the duration of the operation. They found 
that instructing subjects to attend to the top or the bottom of the display had no 
effect on encoding time, though it obviously affected the form in which the display 
was represented. However, when Olson & Filby (135) ingeniously focused attention 
upon the depicted object of an action, encoding was in a form comparable to the 
passive voice and took longer to execute. It seems that verification like other choice 
reactions is influenced by information-theoretic aspects of the task, and the range 
of alternative stimuli may affect encoding time (134). 

The core of the verification models is, of course, the stage when the two represen- 
tations are compared. It is axiomatic in both models that a successful match depends 
upon complete congruency between representations; and this view naturally sug- 
gests an underlying abstract encoding common to both pictures and sentences. The 
challenge to the theorist is to devise a plausible set of operations that yields the 
appropriate responses and tallies with the data. Clark & Chase (36) argue that the 
comparison stage has a truth index which is initially set as “true,” and that each 
mismatch reverses this index. A series of comparisons is required because the 
negative component of a sentence is not considered until after its basic unnegated 
proposition. The truth index obviously remains unchanged for a true affirmative. It 
is reversed once for a false affirmative because the representation of the sentence fails 
to match the representation of the picture. It is reversed once for a false negative 
because although its unnegated proposition matches the representation of the pic- 
ture, its negative component does not. And it is reversed twice in the case of a true 
negative because first the unnegated proposition and then the negative component 
fail to match the representation of the picture. The resulting interaction between the 
form of the sentence and its truth value has been abundantly confirmed: true 
affirmatives are easier than false affirmatives, but true negatives are harder than false 
negatives (e.g, 32, 168). In some experiments, however, subjects rid themselves of 
negative sentences by converting them into equivalent affirmatives, e.g. the predicate 
“isn’t odd” is changed to “is even.” Naturally, such conversions are particularly 
prevalent with binary predicates (168). When subjects adopt such a strategy, their 
results are compatible with the models on the assumption that conversion modifies 
the representation of a sentence and takes an additional amount of time. 

Contrary to the central assumption of the verification models, Tversky (169) has 
suggested that a name may be converted into a pictorial code or a picture into a 
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verbal code; and Seymour (159) has similarly ergued that the meaning of a word 
in a shape-naming task may be represented in some quasipictorial fashion. Such 
arguments may be correct for simple tasks where subjects are sensitive to the 
restricted nature of the linguistic materials. But if negative sentences were encoded 
pictorially, it would be extremely difficult to explain the standard interaction with 
truth value. Perhaps the real burden of these crizicisms is that the boundary condi- 
tions of the verification models have yet to be clearly drawn. The majority of studies 
have been concerned with seeking confirmatory evidence, and critical tests of the 
serial nature of verification, using the additive factor method (166), have only 
recently been undertaken. Unfortunately, it does not seem too difficult to discover 
embarrassing interactions between factors, e.g. ercoding a negative sentence appears 
to retard both picture encoding (104) and the retrieval of information from long- 
term memory (123). 


Answering Questions 


Answering a question is in many ways a comparable task to verifying a statement, 
especially when the interrogated information has only just been presented. Hence 
it is a simple matter in principle to modify the verification models so that they will 
account for this sort of question answering (35). But the actual details of such 
modifications are harder to establish since the experimental results are complicated. 
A surprising phenomenon was discovered by Patricia Wright (185) in a task where 
listeners heard a sentence followed 5 seconds later by a question about it. The main 
source of difficulty in giving a correct answer about the actor or object was a 
mismatch between the voice of the question and the voice of the original sentence. 
Consequently, passive sentences created difficul-y only when the questions were 
active. When the action itself was queried-—e.g. “what was done by X?”—the 
syntactic form of the question was much less re evant than which noun occurred 
in it. The task was easier when this noun was the frst to be mentioned in the original 
sentence. It might seem that these effects are simply due to the verbatim retention 
of the sentences. In a subsequent replication, however, Wright (186) used an inter- 
polated task to prevent rehearsal, but the pattern of results was identical. A further 
experiment suggested that even when the answer to a question is a single word, its 
production covertly involves a complete clause, and that the form of the question 
determines the form of this clause. This principle zpplies even where the action itself 
is queried: the noun that occurs in the question becomes the first noun of the 
answer. 

There are still more complex results from stucies of question answering. Some 
experiments conducted by Smith & McMahon (1€2) detected direct effects of voice: 
where the sentences and questions concerned the relative order of two participants, 
passive sentences took longer to encode than active sentences. There were also 
striking effects of both intricate semantic variables and the order of presentation of 
sentence and question. What makes such results particularly difficult to interpret is, 
as the authors acknowledge, the possibility that subjects adopted special strategies 
to deal with the task. 
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The Natural History of Negation 


The main danger with the models of verification is that their very success in certain 
situations may lead to unjustified claims, such as that the comprehension of all forms 
of negation can be accounted for by a single model. The example of negation is 
instructive because Wason has argued for a number of years that its proper function 
is to deny misconceptions and misunderstandings. The reason that negative sen- 
tences are not ordinarily noted for their difficulty is perhaps because their interpreta- 
tion requires the listener merely to consider changing the truth value of some 
proposition that he has already interpreted (174). Indeed, the ordinary negative 
seems analogous to the false negative of the laboratory, since it probably calls for 
only a single mental reversal of truth value (173). In evaluating a similar thesis, 
Greene (72) found that it was much easier to determine that an affirmative and a 
negative sentence differed in meaning than to determine that they were synonymous. 
It is also easier to appreciate that an assertion has been denied when the denial is 
an explicit negation rather than an implicit negation (98). 

One tentative moral that might be drawn from this aspect of negation is that there 
are good psychological reasons for the existence of each sort of grammatical clause. 
If there are circumstances where negatives are easier than affirmatives, then there 
are likely to be favorable circumstances for any option within a clause. There may 
be no intrinsically difficult clause constructions, only difficult combinations of them. 


LANGUAGE AND MEMORY 


It is parsimonious to assume that sentences are remembered, at least initially, in the 
form in which they are interpreted; and such an assumption is often made, as 
Fillenbaum (53) has emphasized, when making inferences about comprehension 
from the results of memory experiments. But since these results so often depend 
upon the task the subjects have performed (26, 50, 187), there is uncertainty about 
whether the experimental procedure affects initial interpretation or subsequent me- 
morial processes. Nevertheless, it is generally accepted that sentences are not nor- 
mally retained as strings of unrelated words, and a variety of hypotheses have been 
put forward about their mode of organization in memory. 


Memory and Syntax 


A simple proposal is that sentences tend to be retained in “chunks” that correspond 
to their surface constituents (e.g. 94). Although effects of surface structure have been 
reported in the rote learning of connected discourse (4), their natural locus appears 
to be the short-term retention of sentences (82). When an individual is presented 
with a sentence spoken in normal intonation followed by a probe word to which he 
has to respond with the next word in the sentence, the latency of his response 
appears to reflect the surface structure of the sentence (1). Since pauses in linguistic 
and nonlinguistic materials have a comparable effect on the probe task (180, 181), 
the apparent effects of surface structure may be due to rhythmic aspects of the 
perceived or rehearsed intonation of the sentence (cf. 120, 121). 
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We saw previously that certain aspects of syntax affect comprehension, and 
several theorists have proposed that such features will also affect retention. A 
diversity of syntactic indices have been canvassed, including transformational com- 
plexity, Yngve depth, and lexical density (i.e. the proportion of content words in a 
sentence). Unfortunately, while Perfetti (142) found an effect of lexical density but 
no effect of depth in short-term retention, Backarach & Kellas (7) holding lexical 
density and depth constant found an effect of transformational complexity, and, in 
more long-term tasks, Wearing (e.g. 176) found an effect of depth but little or no 
effect of transformational complexity. There is not likely to be any simple resolution 
of these results. It is difficult to see why any syntactic property should ordinarily 
be relevant to the retention of intelligible senteaces, unless it is being deliberately 
used as an aid to memory. Paradoxically, unraveling the mechanisms of intentional 
memory could prove extremely difficult simply because of an enhanced sensitivity 
of subjects to the nature of the materials. 

The obvious alternative to some sort of superficial segmentation is, of course, a 
deep structure representation of sentences. But since our earlier analysis of percep- 
tion suggested that deep structure had no independent psychological validity, it 
should not be surprising to learn that recent werk continues to cast doubt on this 
hypothesis. It was originally argued, for example, that the amount of storage space 
occupied by a sentence depended on the number of transformational “footnotes” 
such as Passive, Interrogative, and Emphatic specified in its underlying structure. 
This view has encountered a whole series of setbacks (e.g. 49, 61, 69). Deep structure 
theories about the efficacy of words as prompts to the recall of sentences have 
perhaps fared a little better, but they have rot gone unchallenged. Certainly 
prompted recall can be affected by factors othe- than syntax. It is enhanced, for 
instance, by the presence of co-referential items m separate clauses (110), provided 
that the relation is conspicuously signposted by thz use of a pronoun. The facilitation 
is not simply due to the greater ease of processing pronouns (119). On the contrary, 
children’s memory is improved by explicit repe’itions of nounphrases within the 
same sentence (100), presumably because they hzve yet to master the intricacies of 
pronominalization. The integrative function of pronouns is further corroborated by 
the observation that when a sentence is partially recalled, the presence of a pronoun 
rather than a noun is more likely to lead to its complete recall (10). 

Perhaps the best illustration of the drift away from explanations in terms of deep 
structure is provided by the work of Rohrman and his colleagues on memory for 
nominalizations. Linguistic analysis shows that a subject nominalization such as 
“growling lions” has a simpler underlying structure than an object nominalization 
such as “digging holes,” because the object nom:nalization requires an additional 
argument representing its deleted subject. Rohrrnan’s original view (cf. 149) was 
that this difference explained why subject nominaizations were easier to remember 
than object nominalizations. However, it was evident in the experimental materials 
that the subject nominalizations generally involved animate nouns and intransitive 
verbs, whereas the object nominalizations generaly involved inanimate nouns and 
transitive verbs. Subsequent tests showed that animate nouns were easier to remem- 
ber than inanimate nouns, and that intransitive ve-bs were easier to remember than 
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transitive verbs (144, 149). Rohrman concluded, with some justice (8, 150), that the 
critical factor was the lexical representation of nouns and verbs. 

Just when it looked as though a plausible account of linguistic memory might be 
given in terms of undifferentiated syntactic factors on the one hand and semantic 
factors on the other hand, a quite separate conception of memory emerged. 


Memory and Imagery 


In a number of publications (e.g. 137, 138), Paivio has argued that imagery is a 
highly potent mode of representing sentences and other verbal materials. According 
to this hypothesis, certain so-called “concrete” sentences are represented in the form 
of sensory images, whereas other so-called “abstract” sentences are represented less 
efficiently in some form of verbal code. Particular situations are likely to inhibit the 
use of imagery (6), and particular individuals are likely to be incapable of forming 
sensory images (141). Nevertheless, imageability seems to be an important variable. 
Two independent studies, for example, have shown that Rohrman’s subject nomi- 
nalizations were rated as easier to image than his object nominalizations (139, 175); 
and when imageability was carefully manipulated, it was the only variable to have 
a consistent effect upon recall. Yet the interaction between imageability and type of 
nominalization, with its varying pattern from one experiment to the next, suggests 
that unknown factors are also at work. 

There have been several other reappraisals of established findings in the light of 
the imagery theory. It was generally considered that the subjects of sentences are 
easier to recall than their objects, and that they differ in their efficacy as prompts 
in the same way (86). However, when imageability was controlled, these differences 
disappeared (89). Likewise, earlier studies had shown that the nouns in adjective- 
noun pairs tend to be better recalled than the adjectives (86), and that they differ 
in their efficacy as prompts in the same way (112). But when their imageability was 
equalized, the asymmetry in recall disappeared (12, 143). 

It seems incontrovertible that sentences can be remembered in the form of images. 
But although such a view is often pitted against alternative linguistic explanations 
of experimental findings, it is not necessarily in fundamental conflict with them. 
When imagery is used, sentences have to be translated into images, and at recall the 
images have to be translated back again into sentences. The probability of error 
during these processes, as Paivio (139, 140) admits, may well be related to the 
complexity of the material to be remembered. Certainly linguistic factors seem to 
be necessary to account for the finding that the imagery value of the object noun 
is a better predictor of verb recall than the imagery value of the subject noun (89). 
And if the differences in rated imageability could be explained, they might depend 
upon linguistic complexity as well as more obvious perceptual attributes (102). 

The task for the Imagist psychologists is to demonstrate conclusively that con- 
crete sentences are encoded as sensory images, and that it is in virtue of this 
representation that they are better remembered than abstract sentences. Begg & 
Paivio (13) found that subjects were sensitive to changes in the meaning of concrete 
sentences, and merely sensitive to changes in the wording of abstract sentences. Such 
differences are consistent with the theory but inconclusive. A recent study has 
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demonstrated that the abstract sentences were harder to understand and that 
changes in their meaning were correspondingly harder to detect (92). Hence, the 
most that can be safely concluded from the original experiment is that individuals 
retain the sense of sentences, as opposed to verbiage, in proportion to their grasp 
of it. More direct tests of encoding have also sometimes yielded results contrary to 
the imagery theory (e.g. 179). It seems incontrovertible that sentences can be re- 
membered without recourse to imagery. 

One of the consequences of the revival of interest in imagery, and in other forms 
of mnemonic, has been an enormous increase in the use of sensible linguistic materi- 
als in studies of associative learning; and there is wide-ranging evidence for an 
improvement in such tasks as paired-associate learning when subjects are encour- 
aged to form a unifying image between stimulus and response (e.g. 19, 20). The study 
of sentences from an associative point of view—a topic too vast to be reviewed here 
—has revealed that in incidental learning tasks a sentence has a definite integrative 
function (2). Likewise, when subjects respond associatively to one word with an- 
other from the same sentence, the pattern of their responses is consistent with 
neither a surface nor a deep structure representation. It is best explained in terms 
of semantic processes (177). 


Memory and Meaning 


When subjects are unaware that their memory for a particular sentence is going to 
be tested, they rapidly lose the ability to recall its precise wording. In a series of 
studies, Jarvella (e.g. 90, 91) has investigated the ability of listeners to perform a 
running memory task in which from time to time they have to recall the immediately 
preceding part of a connected passage. Typically, memory is reasonably accurate 
only for the final sentence that was heard, and it is verbatim only for its final clause. 
Caplan (25) has shown that the latency to identify a probe word is faster if it, too, 
comes from the immediate clause rather than an earlier clause, regardless of the 
amount of interpolated material. Similarly, a study carried out by Goldman-Eisler 
(70) has established the importance of the clause in simultaneous translation. Such 
results bear out the thesis that verbatim information is normally lost to memory 
almost as soon as it has been used, and that the clause is the natural unit in such 
transactions. 

One of the difficulties in evaluating the importance of the clause in perception is 
to determine whether its boundary is associated with the assignment of underlying 
structure or with a more general process of semantic interpretation. Its importance 
in memory is similarly ambiguous since the retention of meaning is often sufficient 
to enable a subject to reconstruct underlying structure. However, where sense and 
syntax are not confounded, it appears that underlying grammatical relations are 
rapidly forgotton. Fillenbaum (53) found that subjects in an incidental memory task 
readily confused sentences of the form “If you do that DU hit you” with those of 
the form “Do that and DU hit you.” Another recent study obtained similar confu- 
sions between sentences of the form “John liked the painting and he bought it from 
the Duchess” and “The painting pleased John and the Duchess sold it to him” (97). 
The meaning of sentences in these studies and others (e.g. 11) was retained with 
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remarkable accuracy. And, as Rosenberg and his colleagues have shown, purely 
semantic manipulations can affect the memorability of sentences. Semantically well 
integrated sentences such as “The doctor cured the patient” are better recalled than 
less well integrated sentences such as “The doctor shook the author” in both 
intentional and incidental tasks (e.g. 151). 

Inference is likely to play an important part in comprehension, and it is from 
studies of memory that this fact first emerged in the psycholinguistic literature. It 
had been forced upon theorists by some of the problems of analyzing connected 
discourse. For example, Winograd (184) pointed out that there is no linguistic rule 
for relating anaphoric pronouns to their appropriate nounphrases. The listener relies 
upon a heuristic inference in order to appreciate the contrast between “Peter put 
the package on the table, but because it was round, it rolled off’ and “Peter put the 
package on the table, but because it wasn’t level, it slid off’ (cf. 132). The role of 
inference in memory has been most notably demonstrated by Bransford & Franks 
and their colleagues (21). They observed that subjects presented with a sentence like 
“Three turtles rested on a floating log, and a fish swam beneath them” would readily 
assume in a recognition test that the sentence had read “Three turtles rested on a 
floating log, and a fish swam beneath 2 "7 However, if the original sentence merely 
stated that the turtles were beside the log, such an inference is less plausible, and 
the subjects were much less likely to make the false recognition. Such inferences 
probably occur during the interpretation of a sentence; yet an unpublished study by 
Charles Jenkins (cited in 71) suggests, surprisingly, that they are made later. 


Memory and Connected Discourse 


It is debatable whether in the ordinary course of events individual sentences are 
remembered. It seems more likely that once their meaning has been grasped, they 
will be retained not as separate entities but by integration into the relevant stock 
of long-term memories. Bransford & Franks have shown that there appears to be 
a strong tendency for separately acquired information to become related in this 
fashion. Their subjects had considerable difficulty in distinguishing whether they 
had been originally presented with such sentences as “The ants ate the jelly” and 
“The jelly was on the table” or with more complex sentences such as “The ants ate 
the jelly on the table” (22, 64). A further study has established that such integrations 
occur in the spontaneous interpretation of connected discourse (93). 
Psychologists have been interested for a long time in the ways in which coherent 
discourse is remembered, since it is a problem of both practical and theoretical 
significance (cf. 28 for a number of reviews from both orientations). The main topic 
that has been investigated experimentally is the effect of thematic knowledge upon 
retention. One of the first of these studies, carried out by Pompi & Lachman (145), 
showed that where the theme of a text was obvious, subjects would readily assume 
that words related to it had occurred in the passage. Dooling & Lachman (48) went 
on to demonstrate that the recall of a somewhat vague passage was considerably 
enhanced when it was given an appropriate and revealing title. A similar study by 
Bransford & Johnson (23) replicated such findings and also showed that the compre- 
hension and retention of sentences could be improved by presenting a picture of the 
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general situation they described. Few psychologists would be surprised by this result 
because without the picture to set the scene the sentences hardly appear to relate 
to one another—the reader is distinctly reminded of the White Rabbit’s evidence 
in Alice in Wonderland. What is particularly interesting about such passages, and 
stands in need of further investigation, is the variety of linguistic devices that can 
be exploited to avoid having to make clear what is being discussed, e.g. the use of 
anaphoric pronouns in the absence of a prior specification of their reference, e.g. “It 
takes some skill but it’s easy to learn," the use of abstract or generic descriptions 
for specific activities, e.g. “Too many people doing the same thing can also cause 
problems,” and the deliberate omission of an underlying argument, e.g. “Rain, 
however, soaks in very fast.” (All of these examples come from a passage about 
flying a kite.) It seems plausible that such variables would have a similar effect upon 
the memory of individual sentences; and, indeed, Dooling & Lachman (48) found 
such effects even when the passage was presented in a scrambled order. 

The ultimate problem in the study of connected discourse is to discern what 
factors make for its maximum cohesion and how it is mapped into some form of 
memorial representation. We know that individuals tend to remember what is 
important (e.g. 95), and that they are sensitive to structural and cohesive factors in 
the text (e.g. 146), but we are almost totally ignorant about the processes involved 
in the long-term representation of discourse. We do not even know whether such 
linguistic memories entail different principles of organization from the episodic 
memories of daily life. 
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Wilbert J. MeKeachie! 
Department of Psychology, University of Michigan, Ann Arbor 


The first problem in reviewing research on instructional psychology is determin- 
ing what “instructional psychology” means. In this review we take it to mean those 
things relevant to teaching, educating, or instructing. The question we are trying to 
answer is “What factors influence the effectiveness of instruction?” Our structure 
is a relatively loose one in which we group together studies primarily dealing with: 
(a) the learner, including such variables as anxiety, aptitude, and social class (most 
of this section deals with attribute-treatment interactions); (5) the teacher and 
teacher style; (c) the teaching methods, technology, and characteristics of the class; 
(d) the objectives and content, including microfeatures of instruction such as feed- 
back and reinforcement. 

The two earlier reviews of instructional psychology in this series (Gagné & 
Rohwer 51, Glaser & Resnick 56) approached the subject in different ways, and this 
review obviously reflects my own interests and biases. As compared with the earlier 
chapters, this one probably gives greater weight to molar instructional variables and 
less to those variables traditionally associated with research in verbal learning and 
performance. This chapter supplements rather than supplants these excellent pre- 
ceding reviews of instructional psychology. Most of my research and experience 
have been in the area of college teaching, and the reader should be warned that 
research dealing with college students and adults has received disproportionate 
emphasis in this review. And because I have a great interest in improving teaching, 
I trust I may be forgiven for occasional paragraphs directed more to the teacher than 
to researchers and theorists. 


1I am indebted to Terry Miller, Nan Holmes, Margy Moffett, Yi-Guang Lin, Richard Mann, 
James Greeno, Stanford Erickson, Rob Seltzer, James Kulik, and Karl Zinn for helpful 
comments and to Bette DeKarske, who handled the onerous task of coordinating the bibliogra- 
phy with the text. 
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It is probably not pure chance that led me to start this chapter with a section on 
the student. The term “instructional psychology” is not one I like as well as “psy- 
chology of learning” or even “educational psychology,” for “instruction” carries a 
connotation of teacher direction or building that 1s less pleasing to me than the 
emphasis on the student implied in “learning,” or even the relationship of “leading 
from” implied in “educate.” Even though the student as a person gets lost in this 
chapter’s consideration of traits and attribute-treatment interactions, I still have 
some sense that the most important focus of instruction is a living, breathing, 
seeking, human being. Were I a sociologist I would no doubt begin with the broader 
social setting and the role of the school in society. Certainly much of the most 
important criticism of education in our society aims not at the specifics of instruction 
but at the bureaucratic nature of the school in inculcating values and knowledge and 
screening students for positions in our social and occupational heirarchies. Thus the 
author begs the reader to remind himself from time to time that “state and trait 
anxiety,” “Keller plan,” “inductive and deductive strategies,” and our other topics 
have to do with the feelings and experiences of young people in an institution which 
may or may not fit well with the rest of their lives. 


THE LEARNER 


Anxiety 


Learner anxiety has for some time been a major interest of educational researchers. 
In many situations anxiety seems to be detrimental to learning and/or to perfor- 
mance on tests, but the findings are inconsistent (see Martens 94), and the mecha- 
nisms by which anxiety may influence learning and performance have not been 
clarified despite conflicting theories. During the past 2 years some progress has been 
made in this area. 

Researchers at two centers of activity, the City University of New York and 
Florida State, have used variations in programmed materials as the instructional 
method. In a typical but rather complicated study, Tobias & Abramson (158) 
compared the interactions between stress, anxiety, and response mode. The basic 
variable in response mode was whether the program required only reading or a 
constructed response. An additional variable was reinforcement or nonreinforce- 
ment (i.e. whether or not the correct answer was given on the next frame of the 
program). Anxiety and stress did interact, but the more anxious students learned 
familiar material better than less anxious students in the stressful condition. Note, 
however, that this was a learning situation with a relatively low error rate (3.7%). 
The expected interaction in the opposite direction should be more likely to occur 
with more difficult materials, as was demonstrated in the experiments by O’Neil et 
al (116) and Spielberger, O’Neil & Hansen (147). 

Spielberger’s approach differentiates the state of anxiety from the trait of anxiety. 
The state of anxiety consists of unpleasant consciously perceived feelings of nervous- 
ness, tension, and apprehension with concomitant physiological arousal. The trait 
of anxiety is the tendency to perceive a wide variety of situations as threatening and 
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to respond with anxiety; i.e. trait anxiety is a relatively stable individual characteris- 
tic (Spielberger 146). Trait anxiety is the primary component of most tests of 
anxiety. 

The distinction between state anxiety and trait anxiety is an important one, since 
one is thereby led to look at changing effects of anxiety in a given situation. One 
might expect, for example, that anxiety, when it is aroused, would first be disruptive 
for learning difficult materials, but that the course of the effect of anxiety might then 
depend upon what sort of defense mechanisms were used. Spielberger et al (147) and 
O’Neil (115) present evidence that students with high trait anxiety show diminishing 
state anxiety over time in stress conditions. Since this trend did not occur for low 
trait anxiety students, it suggests that high trait anxiety students may have learned 
to defend against anxiety relatively quickly in stressful situations or that they at least 
have defended against admitting they’re anxious. 

Do these defenses take hold as soon as the student begins a test, or only when 
he encounters difficult items? Do the defenses break down if the test is too difficult? 
Testing experts have usually advised a test sequence moving from easy to difficult 
items on the assumption that easy items increase confidence and reduce the amount 
of damage to performance likely to be caused by high anxiety. Munz & Jacobs (110) 
found that when sequencing achievement tests items, a hard-to-easy order is better 
than a random order, and perhaps even better than an easy-to-hard order for 
students high in trait anxiety. One explanation would be that the highly anxious 
students feel a great sense of relief when they come to easy items after trying to solve 
hard ones; with an easy-to-hard sequence, on the other hand, anxiety is increasing 
and thus likely to be more disturbing. 

In programmed learning tasks it is difficult to separate the effects of anxiety upon 
learning and the effect on the responses required by the program. Moreover, there 
is the probability that being questioned arouses anxiety, particularly if one does not 
know the answer. The effects of anxiety on performance in a low error rate program 
may be quite different from effects on ordinary learning of text or upon performance 
on a difficult achievement test. Thus, for example, Ray, Katahn & Snyder (126) 
found no effect of trait anxiety on learning items of prose except when test items 
were interspersed. Anxiety not only affected original performance adversely but also 
affected retention with original learning controlled. 

Tennyson & Wooley (155) compared the performance of high and low A-State 
subjects on a self-instructional poetry classification program in which an easy sec- 
tion followed a difficult section of poetry selections. Subjects initially low in A-State 
made fewer errors on the difficult section, but high A-State subjects made fewer 
errors on the easy section. These results are contradictory to those of O’Neil (115). 
However, O’Neil’s subjects were all females, while those of Tennyson & Wooley 
included both males and females. Unfortunately, analysis of sex differences was not 
reported in the Tennyson & Wooley study, but it seems likely that this is an 
additional complicating factor. 

These studies indicate that predictions about the effect of anxiety on performance 
are not really very simple. High anxiety is not necessarily detrimental to learning. 
Its effects depend not only upon task difficulty but upon sequence of tasks. 
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Helping the Anxious Student 


In attempting to reduce the possibly deleterious effects of anxiety upon learning, two 
approaches have been used. One attempts to reduce the anxiety; the other accepts 
the anxiety and attempts to help the learner by providing aids for the specific 
damaging processes produced by anxiety. 

An example of both the first and second approaches is that used by Sarason (141) 
in studies in which reassuring instructions preceded the learning task. In one such 
study “reassurance” (anxiety reducing), “task orientation,” and “motivating task 
orientation” instructions all prevented the detriment to the performance of high test 
anxious subjects found under an achievement orientation condition, and there was 
no difference between the three positive conditions. In fact, under these conditions 
the performance of the high anxious subjects was superior to that of low test anxious 
subjects. 

Wine (166) observed classroom behaviors of high and low test (trait) anxious 
children in an art class just before an important examination. In this situation high 
anxious children sought teacher approval, listened attentively to the teacher, and 
spent more time sitting idle. Wine summarizes her results by suggesting that the 
high test anxious children under stress become more alert to evaluative cues, less 
alert to task cues, and more concerned about self worth, while low test anxious 
students focus more fully on the task. Wine’s theory, developed more fully in her 
review of previous research (165), helps account for the positive effects of Sarason’s 
instructions, since his instructions deal with the areas in which anxious students 
should have difficulty. “Task orientation” and “motivating task orientation” in- 
structions should help the test anxious student focus his attention on the task, thus 
diminishing the normal deficiency in this process. 

Many of the Florida State studies also use the second approach of compensating 
for deficiences in information processing produced by high anxiety in programmed 
and computer-assisted-instruction (CAI). Since CAI offers a relatively well-con- 
trolled instructional situation, the Florida State group has been able to study quite 
carefully the particular functions which anxiety affects. For example, Leherissey et 
al (86) followed by Sieber’s (144) argument that anxiety’s disruptive effects may be 
through its effect on memory. To assist the anxious student, a list of previous errors 
on the material being studied was given as a memory support. The results of the 
experiment were complex. Unexpectedly memory support tended to increase state 
anxiety; presumably being reminded of errors was anxiety arousing. This compli- 
cates interpretation of results since memory support produced no overall improve- 
ment in performance. But students high in state anxiety did better in the memory 
support group than the high state anxious students without memory support. The 
study by Meyers & Dunham (104) also supports the theory that anxiety has a 
particular effect on memory. They found that while memory span and performance 
on a concept learning transfer task were positively correlated for low trait anxiety 
subjects, the relationship washed out for high anxiety subjects. 

One of the basic questions relevant to practical application of knowledge of 
attribute-treatment interactions is who should determine which students get which 
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treatments. Even if the interaction were clearer than it usually is, the task of fitting 
instruction to the varied students in a typical class may well be too difficult for 
teachers to perform. Thus it is attractive to hypothesize that students themselves can 
select the instruction optimal for themselves. O’Neil (114) provided support for this 
hypothesis by providing memory support at the option of anxious students. This 
condition was superior to one in which mandatory memory support was provided. 

In the studies cited above the major concern has been to reduce the detrimental 
effects of high anxiety; thus the search for interactions between conditions and 
anxiety have focused on the high-anxious group. Tallmadge, Kasten & Shearer 
(154), however, in a complex study of interactions between inductive and deductive 
teaching styles, understanding and rote learning, two subject matters, and 39 learner 
characteristics, found only one interesting interaction. Low anxiety students 
achieved more with deductive rather than inductive instruction. This result is unex- 
plained. 


Aptitude 


The attribute-treatment interaction of longest interest to psychologists has been the 
aptitude-treatment interaction—an area recently reviewed by Cronbach & Snow 
(31). One new study opens the possibility of progress in this area by differentiating 
differing kinds of aptitudes. Koran, Snow & McDonald (82) studied the interaction 
of student aptitude with three methods of training teacher trainees in “analytic 
questioning”: (a) videotape portrayal of the skill; (6) written instructions; and (c) 
no special instruction. Both experimental treatments produced greater effects on 
behavior in microteaching situations than the control. Videotape presentation was 
the best of the three, particularly for students who had made high scores on the film 
memory test, but the written presentation was more effective for subjects who had 
scored well on the hidden figures test, a result surprising to the researchers. They 
suggest the possibility that a given treatment, in this case videotape, may either work 
well for those whose abilities fit that treatment, such as students high in “film 
memory,” or it may serve a compensatory function, as for the low “hidden figures” 
students who may lack ability to visualize. 

At the moment there seems to be a quiescence in the area of research on cognitive 
variables interacting with treatments. The Koran, Snow & McDonald study as well 
as the studies on anxiety suggest that more careful analysis of individual and 
treatment characteristics may pay off. 


Achievement Ortentation 


Achievement motivation and various achievement orientations have been obvious 
candidates for investigation by those interested in instruction and education. Among 
the more widely used scales are Gough’s Achievement via Conformance and 
Achievement via Independence Scales from the California Psychological Inventory. 
One of the few educational studies in which clear interaction effects were obtained 
was Domino’s study (34) of interaction of these two characteristics with teaching 
style. Those students taught in a manner consistent with their achievement orienta- 
tion, as indicated by the title of the scale, obtained significantly higher means (than 
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students taught in a discordant manner) on multiple choice tests, essay questions 
involving factual knowledge, and on the students’ ratings of the teacher effectiveness 
and value of the course. A somewhat similar result was obtained by Goldberg (57) 
using a criterion of amount of extra nonrequired reading completed. Thus a glimmer 
of hope for attribute-treatment interactions appears. 


Socioeconomic Status and Early Education 


The “best-seller” of the year in the area of instructional psychology was one by 
Jencks (70). His book has been so extensively reviewed and criticized that there is 
probably little point in an additional review here. However, it is worth pointing out 
that Jencks did not conclude from his review of the rather scanty research available 
that schools make no difference. Rather he concludes that not going to elementary 
school makes a marked difference for lower class children and that there are 
significant differences between elementary schools in effectiveness. 

Despite the heat generated about the effects of early instruction, relatively little 
additional evidence seems to have emerged in the past 2 years. Jacobson et al (68) 
found that preschool children from poverty backgrounds could be systematically 
trained in concept learning in ways that increased scores on the Stanford-Binet as 
well as on tests of concept learning. Jacobson & Greenson (69) reported that half 
the IQ gains were maintained 14 months after the training ended. Unfortunately the 
control group was not retested. 

What does a gain in IQ mean in studies such as this? One answer is that an 
intelligence test is a pretty good measure of generalized achievement and that gains 
in IQ thus indicate that the training has produced cognitive gains outside the specific 
tasks upon which the training was done. Another interpretation of IQ gains ts that 
they largely represent changes in motivational and social factors affecting test 
taking, as suggested by the study of Zigler & Butterfield (171). But most readers of 
studies in which IQ gains are produced are probably reading more into the intelli- 
gence test scores. Intelligence tests are primarily used as predictive devices. The 
implication of a gain in IQ is that the individual who has made the gain has 
developed skills or aptitudes which will be predictive of later academic achievement. 
But when IQ scores are changing as a result of a training program, are the scores 
after the program better predictors than those obtained at the beginning of the 
program? The question is not whether IQ gains? persist but whether or not school 
and other intellectual performance is thereafter at a higher level. 

To say that the gains in IQ should not be expected to persist without changes in 
the schools is a weak rejoinder. Intelligence tests were designed to predict achieve- 
ment in conventional schools. If changes in intelligence (academic aptitude) have 
really been produced, it is in conventional schools that the effects should be most 


?While many child psychologists have stressed the effects of early training on curiosity or 
achievement motivation, another effect may be the sheer pleasure of familiarity with words 
and schools, Zajonc has shown in several studies (170) that simple exposure leads to liking. 
A concept training task thus not only influences cognitive mechanisms involving concept 
formation but also increases liking for the concepts. 
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apparent. Rohwer (130) deals with this point when he shows that lower socioeco- 
nomic status black children are not as disadvantaged in certain kinds of learning 
not requiring conceptual elaboration as in IQ. Learning skills, such as conceptual 
elaboration, can be taught (Rohwer & Ammon 131) so that lower socioeconomic 
status children can learn more effectively in school. But in all of these efforts good 
longitudinal data are still needed. 


Attribute-Treatment Interactions 


The studies in this section have been based largely on the faith that instruction could 
be improved if it were adapted in terms of differences in student characteristics. The 
results of the research reported provide some empirical support of that faith. But 
the largest recent study of trait-treatment interactions produced relatively limited 
results. Goldberg (57) compared lecture and independent study methods and 
quizzes vs papers in two psychology courses. Rather than choosing personality 
variables on the basis of a theory about the personality characteristics of individuals 
likely to be differently affected by the two methods, he tested 350 different personal- 
ity measures for interaction effects with the teaching methods. Not surprisingly only 
about one-third more significant interactions were found than would have been 
expected by chance. 

Despite Goldberg’s rather pessimistic view of attribute-treatment interactions, 
few citizens, few educators, and not even many researchers would reject the hope 
that educational environments can somehow be varied in ways that will permit 
adaptation to characteristics of the developing learner. A measure like an intelli- 
gence test designed to predict success with all sorts of schools and teachers is not 
likely to be effective in discriminating particular interactions, but other measures, 
dealing with changing and less general characteristics, may be more useful in 
interaction analyses. Glaser (55) has compared adaptive and selective (survival of 
the fittest) modes of education and concluded that our penchant for a fixed educa- 
tional mode derives from the tendency to think of students as having fixed enduring 
traits. What we need, he suggests, are measures of new aptitudes—aptitudes that 
are trainable, aptitudes conceptualized in terms of processes. 


THE TEACHER 


Some aspects of the teacher’s role in instruction are, of course, implied in the studies 
of Attribute-Treatment Interaction discussed above. In the studies reported in the 
previous section, however, our focus was upon learner characteristics, and the 
teacher was essentially the context. In this section the teacher moves to the fore- 
ground, and we look at ways in which teachers differ and how these differences affect 
learning. 


Dimensions of Ratings of Teaching Effectiveness 


The use of student ratings of teaching effectiveness is a topic which has generated 
a great deal of heat and research activity, particularly at the college level. Not all 
of the controversy is directly relevant to the topic of this chapter, but the question 
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of dimensions of teacher characteristics related to student learning seems directly 
applicable to instruction. Costin, Greenough & Menges (29) have reviewed the 
research up to 1971. Recent studies have stirred up additional interest. 

Feather (38) reported the results of factor analyses of a 22-item scale derived from 
the University of Washington scale. Three factor analyses were performed. The 
three samples were five first-year, five second-year, and seven third-year psychology 
courses at the Flinders University of South Australia. 
1.The first factor, Instructional Competence, included items such as: 

“Is clear and understandable in his explanations”; 

“Instructor’s general teaching effectiveness in comparison to other psychology 

teachers”; 

“Interprets abstract ideas and theories clearly.” 

2. The second factor, Interest, included: 

“Subject matter of the course is interesting”; 

“Value of the course in comparison to other courses you have taken”; 

“Has helped broaden my interests.” 

3. The third factor, Instructor Attitude to Students, included: 

"Has sympathetic attitude toward his students”; 

“Treats his class as adults”; 

“Is reasonable in his demands on student time.” 

These factors’ seem comparable to those of previous factor analytic studies. 
Factor 1 appears to be comparable to one found by Field et al (39) and by Frey (46). 
Factor 2 also is comparable to one found by Frey. Factor 3 appears to be much the 
same as that found in Germany by Miiller-Wolf & Fittkau (109) and in Australia 
by Field et al’s multiple discriminant analysis. The third function found by Field 
et al is that labeled by Isaacson et al (67) as “Feedback.” 

Frey (46), whose study will be reviewed later, found six factors: 

1. Student accomplishment 

2. Organization planning 

3. Work load 

4. Grading 

5. Teacher’s presentations (comparable to Feather’s “Instructional Competence”) 

6. Teacher accessibility (comparable to Feather’s “Instructor Attitude to Stu- 

dents”) 

Frey’s “Student Accomplishment” probably picks up much the same variance as 
Feather’s “Interest” factor with its item “Value of the course ...” and enters the 
area tapped more extensively by Hogan & Hartley’s (63) items on student self- 
development. Hogan & Hartley’s student self-development items yield three factors: 
General Cognitive Development, Field-Specific Development, and Relevance. 


Factor analytic results are limited by one methodological problem. Since within-class and 
between-teacher covariance are lumped together in these studies, factor analysis primarily tells 
us the dimensions used by students as they perceive a teacher, not necessarily the dimensions 
along which teachers differ. Multiple discriminant analysis seems less vulnerable in this respect 
(see Field et al 39). 
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An interesting feature of these results is that student ratings of teaching effective- 
ness are differentiated from those of the value of the course, with course value more 
closely tied to the important educational goal of developing broader interests. While 
“teaching effectiveness” ratings are not, as some critics have suggested, measures 
of cheap popularity, it is probably worthwhile to get a separate rating of course 
value, which seems to give greater weight to the total plan and organization of the 
course and less to classroom performance. 

The point of these studies is presumably to determine how teachers and classes 
differ, and in terms of student perceptions the answer apparently is that they differ 
along eight or ten dimensions. Now we turn to the crux of the matter: “Do these 
differences among teachers perceived by students make a difference in instructional 
effectiveness?” 

Probably the most publicized recent study in this area was that of Rodin & Rodin 
(129), which attracted attention both because its results were so surprising and 
because they supported the doubts many faculty members already had about student 
ratings. Rodin & Rodin reported a high negative correlation between mean student 
ratings of teaching assistants in a calculus course and the mean course grade of the 
students. Frey (46) repeated the study with a more carefully controlled research 
design and analysis and obtained results in the opposite direction; i.e. the teachers 
whose students achieved the most were rated as most effective. 

All six of the factors on the Frey scale showed positive correlations with mean 
student performance. Even though the N’s of teachers were small (8 in introductory 
calculus, 5 in multidimensional calculus) three of the factors produced statistically 
significant correlations. These three factors were 
1. Student accomplishment: 

“This course has enabled me to identify and analyze central issues in this field”; 

“I can now understand relatively advanced presentations on this subject”; 

“This course has developed my ability to examine the evidence in this field.” 
2. Teacher’s presentation: 

“The teacher made good use of examples and illustrations”; 

“The teacher discussed the course material in an insightful and penetrating 

fashion.” 
3. Organization-planning: 

“The teacher arranged the class schedule in an orderly way and followed it 

closely.” 

Frey’s results indicate that students can report a teacher’s effect on learning when 
they are asked to do so specifically. Note that the “Student Accomplishment” items 
do not ask the student to rate teacher effectiveness but rather to report effects of the 
course upon the student himself. This fits with the findings of McKeachie, Lin & 
Mann (99), who found less striking, but supportive, evidence for the validity of 
student ratings. 

While student ratings can tell us something about how instructors differ and may 
also indicate which teachers are most effective for particular goals, to date they have 
not revealed any instructional characteristics which seem to be universally related 
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to instructional effectiveness. Thus we find here, as in the preceding section on the 
student, a hint that simple, one-variable, linear relationships are not jumping out 
of the research data. Interactions between teacher characteristics, student character- 
istics, and outcomes are also not “bustin’ out all over,” but some tender buds are 
beginning to suggest possible fruitfulness. 


Teacher Expectancy 


Feather’s third factor, Instructor attitude toward Students, represents a dimension 
found in many studies having to do with the teacher’s relations with, and expectan- 
cies of, students. The most famous study of teacher expectancy was Rosenthal’s 
Pygmalion in the Classroom (134), which continues to influence educators 5 years 
after publication. Recent studies agree with Claiborn (23) in failing to replicate the 
Pygmalion effect. Fleming & Anttonen (42) undertook a large scale replication with 
better controls and found that false information given the teacher about pupil IQs 
had no effect on later IQ measures. The study suggested that teachers tended to 
distrust information discrepant with their own assessment of student ability—a 
result according with that of José & Cody (74). Thus once again an exciting finding 
bites the dust. My Fair Lady in the classroom was really fairer than Rosenthal 
perceived. 

Pellegrini & Hicks (123) also failed to find a significant effect of expectancy in a 
tutorial situation except when tutors were shown the test that would be used as a 
dependent variable. Rubovits & Maehr (139), however, were able to produce differ- 
ences in teacher behavior in a situation in which college students were brought 
together with strange children for a one-hour teaching session. Students labeled as 
gifted were more likely to be asked questions and praised for their statements than 
were those labeled nongifted. This finding is not very convincing evidence of the 
ability of inaccurate information to influence behavior of teachers who are interact- 
ing with students over a semester. 

Does this mean that teacher expectancies are unimportant? Probably not. The 
Fleming & Anttonen and José & Cody studies did not really disprove the effect of 
teacher expectancies on student learning, but rather indicated that false information 
did not produce a significant effect on teacher expectancy [as pointed out by Finn 
(40) in a thoughtful review of the expectancy literature]. Rothbart, Dalfin & Barrett 
(135) did find that teachers paid more attention to students designated as “better,” 
and Finn demonstrated that in some settings teachers do hold expectations which 
bias their evaluations of pupil work. What the replications suggest is that the 
connection between teacher expectancies and changes in student academic ability 
is a good deal more complex and uncertain than the publicizers of Pygmalion in the 
Classroom led the public to believe. While it seems unlikely that Rosenthal’s results 
can be fully replicated, it does seem that when the teacher’s expectancies differ there 
may be ensuing differences in teacher behavior toward students. Strongly held 
stereotypes may indeed resist change from pupil-teacher interaction and make 
differences in teacher behavior which may in turn influence student achievement. 
Thus far, however, research has not been able to demonstrate these links. 
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TEACHING METHODS, TECHNOLOGY, AND 
CHARACTERISTICS OF THE CLASS 


After our look at the student and the teacher, the two most important persons ın 
instruction, we now turn to the classroom and the methods of instruction used in 
educational institutions. Lurking in the background of our reflection about classes 
and teaching methods should be the recognition of the context provided by the 
school. While we will cite relatively little research on characteristics of schools 
influencing instruction, there can be little doubt that the nature of schools in our 
society strongly conditions the effects of instructional methods, a point developed 
in theoretical detail by Getzels (53). Moreover, the very structure of the classroom 
suggests certain goals and patterns of interaction. The open classroom, the free 
school, the student-led class, —these are still deviant activities which carry a special 
burden of risk and a resulting aura of anxiety and excitement. And as Cohen (26) 
has pointed out, conventional classes differ in the degree to which the teacher’s goal 
is obedience rather than learning. 

In this section we deal with major instructional methods and characteristics of 
the class such as the Keller plan, programmed learning, computer-assisted instruc- 
tion, simulation, class size, team teaching, homogeneous grouping, student options, 
and student-centered teaching. The next-to-last topic of the section is labeled Stu- 
dent Activity and should probably be with Questioning in the later section on 
Content, but it seems to flow from student-centered teaching, so here it is. 


The Keller Plan 


The brightest new star in the constellations of college teaching methods has been 
the Keller Plan (76), named after one of its major creators, but also variously called 
PSI (Personalized System of Instruction), “individualized instruction,” “contingen- 
cy-management instruction,” or “modular instruction.” (The system is not very 
personalized or individualized except in comparison with the large lecture, which 
is the favorite bete noir to which all new teaching methods wish to be compared.) 

Although the first controlled studies of the Keller plan showed no superiority in 
achievement and less effective use of time as compared with conventional teaching, 
there are now empirical data indicating that the Keller plan can sometimes be 
superior to lectures in affecting student achievement and retention and is usually 
superior in student satisfaction (Born, Gledhill & Davis 17, Cooper & Greiner 27, 
McMichael & Corey 101, Rubinstein & Friedman 138, and Witters & Kent 167). 

Attention is now shifting to research on modifications of the plan to further 
enhance learning. Sullivan & Hartley (152), for example, gave bonus points for 
reaching a given criterion on weekly tests, rather than requiring a certain criterion 
before a student could move to the next unit. The results indicated that bonus points 
had little effect on final examination scores. Knight (80) studied the effect of three 
motivational conditions. The control condition involved taking weekly 10-item tests. 
Students in the two experimental conditions were allowed to retake the test a second 
or third time, but if 100% correct answers were not attained on any one of the 
weekly tests by the third retake, students in one experimental group were punished 


172 McKEACHIE 


by dropping one full grade in the course (e.g. from A to B), and students in the other 
group were given a grade of F (failure) for the course. The latter two groups 
performed better (on the first administrations of the tests), and there was no effect 
on performance in other courses. However, pacing students by giving quizzes made 
no difference in achievement in Lasco’s study (85). Thus, of three attempts to 
increase motivation, the only successful one was a very strong threat. 

What kind of students benefit from the Keller plan? Born, Gledhill & Davis (17) 
found that although the poorer students tended to drop out, those who stayed in 
the course tended to show the most gain from the Keller plan. Ability differences 
affecting gains were not found, however, by Hellström (62). Dogmatism did. not 
interact with method in the study by Rubenstein & Friedman (138). Thus there is 
at present little indication of differential effectiveness of the Keller plan for differing 
types of students. 

What kind of learning does the Keller plan help? One might expect the Keller 
plan to be most effective with simple memorization, but in fact Born et al (17), 
Morris & Kimbrell (108), and Rubinstein & Friedman (138) found it to be more 
effective than lecture for higher level cognitive objectives as well, and Stalling (148) 
found a favorable effect upon motivation for further learning as measured by inten- 
tion to major in psychology. Kelley (77) found similarly wide-ranging favorable 
effects on achievement, test performance, and motivation to major in economics for 
his TIPS (Teaching Information Processing System), a similar method. As with 
most other methods of instruction, it thus appears that the Keller plan can be 
effective for a variety of goals if it is well planned and well administered. The Keller 
plan, well done, requires a great deal of planning and effort. To achieve higher level 
outcomes requires additional effort on the part of teachers and tutors. When the 
Keller plan fails, it is because of lack of such investment rather than inherent 
deficiencies in the method itself (which 1s propably true of discussions, lectures, and 
other methods as well). 


Programmed Learning 


The Keller plan is an attempt to gain some of the benefits of programmed learning 
while transcending its limitations. But traditional programmed learning continues 
to be a frequently used instructional tool. Costin (28), in an excellent review of 
lectures vs other methods of teaching, concludes that programmed study may have 
an advantage over lectures in promoting acquisition of information. Beard & Bligh’s 
comprehensive review of research on college teaching (12) points to a similar conclu- 
sion. 


Computer-Assisted Instruction (CAI) 


Computer-assisted instruction in reading has been under development for over a 
decade. Fletcher & Atkinson (43) report the effects of adding 8 to 10 minutes a day 
of CAI to the normal instruction of 50 first graders for 5 1/2 months. The CAI 
produced substantially better performance both on tests tailored to the CAI objec- 
tives and on standardized achievement tests. Greater benefit for boys than for girls 
was found. Allen (2), however, found no significant differences between CAI and 
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lecture in an introductory psychology course. Students who felt that lecture classes 
spend too much time on each topic and who liked to solve problems and take tests 
did better with CAI than with lecture. 

Vinsonhaler & Bass (161) reviewed ten major reasonably well designed studies of 
the effectiveness of CAI drill and practice on learning in language and mathematics. 
In general the results indicate that 5 to 15 minutes of CAI added to regular 
instruction each day produce performance gains. Unfortunately only two studies 
used a control group with a comparable supplementation. In one (Suppes & Morn- 
ingstar 153) classroom drill was as effective as CAI. In the other (Dick & Latta 33) 
programmed instruction was superior to CAI. Thus it appears that CAI has no 
special magic that will solve our instructional problems. It can relieve teachers of 
some time spent on drill (as can printed programmed instruction). Whether CAI 
is better in this respect than arranging for pupils to tutor one another is not known. 
Intuitively one feels that the computer should have great instructional potential, but 
apparently it is not going to be realized without a great deal of arduous work. 

One promising direction for this research is the approach used by Pask and 
associates (Pask & Scott 121, 122), who carry R. C. Atkinson’s (5) encouragement 
of response sensitive paradigms to the point of a colloquy between the computer and 
the learner to determine the learner’s preferred learning strategy. Pask demonstrates 
that matching the structure of the learning program to the learning strategy (holist 
or serialist) of the student results in more effective instruction than learner control 
or an incompatible program. In general the research on learner control of CAI is 
inconclusive, leading Judd (75) to conclude his review with the suggestion that 
students may need specific training to exercise effective control of their own instruc- 
tion. 

An intriguing aspect of the effort of Pask and his colleagues is that their proce- 
dures are useful not only in teaching statistics but in helping the learner discover 
his own most effective learning strategy to be used both in CAI and non-CAI 
situations. As Zinn has suggested in oral presentations,* the most effective use of 
computers may not be in programming every detail of an instructional sequence but 
rather in using the computer in ways which will help learners make their own 
judgments and decisions more effectively. This does not necessarily imply matching 
instruction to a student’s present cognitive style. It may mean finding where he is 
and helping him move to a more effective strategy or to more flexible uses of 
strategies appropriate for differing kinds of learning. 


Simulation and Games 


One of the more glamorous teaching methods is simulation. Used extensively during 
World War II in training pilots, naval command officers, and other personnel 
required to perform complex tasks, simulation is now used in a variety of instruc- 


*Karl L. Zinn, University of Michigan: A Technology for Instruction in the Age of Infor- 
matics, presented at Lehr Systeme 72, Berlin, April 1972; and Computer Learning Under 
Evaluation (Project CLUE) and an Attempt at Hyperspeech, presented at RAND Corp. 
conference on Computers in Higher Education, October 1970. 
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tional situations. For example, teacher training programs often include experience 
in simulating instruction, frequently with videotape feedback. “Microteaching,” a 
brief simulation involving a specific instructional skill, has been widely publicized 
as a potentially valuable tool for instructing prospective teachers. Jensen & Young 
(72) studied the effect on teaching behavior of a teacher training program. Both 
experimental and control groups had the same training experiences except for three 
7-minute microteaching experiences given the experimental group. Teaching behav- 
ior during the following term of practice teaching was evaluated by raters unaware 
of the experimental variable. The microteaching group proved to be superior on five 
of the six factors of Sinha’s Teacher Performance Evaluation Scale. Thus mi- 
croteaching can be an effective training device. Unfortunately, research on mi- 
croteaching has not produced evidence of its effectiveness in terms of ultimate 
impact on learning of the trainee’s students. Despite the increasing popularity of 
simulation and educational games, little evidence on their instructional effectiveness 
has emerged. 


Class Size 


Class size is probably the hardiest perennial of instructional research. Following an 
initial spate of studies in the 1920s, interest shifted to teaching methods, and the 
popular generalization was that class size was not the important variable but rather 
what the teacher did in classes of different sizes. This reasonable conclusion damp- 
ened enthusiasm for studies of class size, but now after several decades of attention 
to teaching methods, class size again is beginning to emerge as an interesting variable 
which affects student learning, partly because it does have a potent effect on the 
teacher’s choice and flexibility in use of teaching methods. McK eachie (98) reviewed 
research on class size at the college level and pointed out the consistency of direction 
of differences favoring small classes for higher level cognitive achievement and 
motivational and attitudinal outcomes. Moody, Bausell & Jenkins (107) carried out 
a parametric study of the effect of class size on the learning of mathematics at the 
fourth grade level. Using class sizes of 1, 2, 6, and 23, they found the smaller class 
sizes to produce greater learning than the larger classes, with the tutorial (1 to 1) 
class most effective. Similarly Bausell, Moody & Walz (11) found individual tutor- 
ing to be more effective than classroom instruction. Thus the evidence supporting 
small classes continues to accumulate. 


Team Teaching 


Fifteen years of trial of team teaching in elementary schools has thus far produced 
no evidence of its instructional value. Rhodes (127) found in a well-controlled study 
that team teaching in grades K-6 was not only not superior to traditional, self- 
contained classes but was actually worse in affecting improvement in reading skill 
and pupil attitudes. 


Homogeneous Grouping 


The current social climate has placed the practice of homogeneous grouping, track- 
ing, Or streaming under severe pressure, largely in terms of its possible use to 
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preserve segregation by race and social class. Homogeneous grouping may simply 
be based on the assumption that bright students can learn faster than students with 
less ability, but often there is an implicit assumption that the aptitude-treatment 
interactions (discussed in our section on the student) are so important that students 
of different ability levels need to be grouped so that they can be taught differently. 
This assumption finds less support in the ATI literature than in the studies of 
validity of student ratings of instruction, where there has been evidence that teachers 
rated as being more effective by bright students than by poorer students are more 
effective with bright students and vice versa. Research on the effects of homogeneous 
grouping on achievement was reviewed by Barker-Lunn (9), with the conclusion 
that most of the differences favored homogeneous grouping.® Clifford (24) studied 
the effect upon vocabulary list learning of competition in homogeneous fifth grade 
classes. The competitive treatments increased interest but not achievement or reten- 
tion. Clifford presents a model emphasizing the importance of intrinsic motives 
rather than extrinsic reward. 


SEX DIFFERENCES IN EFFECTS OF HOMOGENEOUS GROUPING BY ABILITY Levy & 
Tucker (89) present data suggesting that homogeneous grouping may be particularly 
effective for bright boys, while heterogeneous grouping was more effective for the 
less intelligent boys. For girls the results were reversed. House (65) has presented 
evidence that males are oriented toward testing out their abilities in social achieve- 
ment situations, while females are more likely to avoid social competition. The 
tendency to seek competition was positively correlated with positive self-attributes 
for males but negatively correlated for females. If positive attributes correlate with 
intelligence as assessed in the Levy & Tucker study, and if homogeneous grouping 
increases competitive pressures, the Levy & Tucker study supports House’s theory. 
Gjesme (54), working from the assumption that girls have subjective probabilities 
of success lower than objective reality, suggested that in a heterogeneous classroom 
high ability girls would have their achievement-related motives more strongly 
aroused since they would then subjectively perceive a moderate level of success, 
which according to J. W. Atkinson’s theory (4) should be maximally motivating. 
Gjesme’s results support this theory, as do those of Levy & Tucker. 


Student Options and the Contract System 


Somewhat more personalized than the “Personalized System of Instruction” is the 
increasing practice of offering students choices of various options for achieving 
course objectives and perhaps even some options with respect to objectives. As we 
saw in our review of learner control of computer-assisted instruction, neither psy- 
chologists nor learners are very good in choosing the most effective instruction. 
Pascal (120) studied the effects of three instructional! options offered to students: 
lecture, lecture with discussion, and independent study. Although students given 

5A comprehensive review by Findley & Bryan reaches negative conclusions about homoge- 


neous grouping. Findley, W. G., Bryan, M. W. 1971. Ability grouping: 1970. Athens, Ga.: 
Center for Educational Improvement. 
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their preference developed more positive attitudes toward psychology, there was no 
effect on cognitive outcomes. The three methods did have different outcomes inde- 
pendent of the option aspect. The lecture group was superior on a test of knowledge, 
while the independent study group was superior in evaluation of a previously unread 
psychology article. 

Another sort of option is the contract system of teaching in which the student 
negotiates a contract with the teacher. The contract normally specifies learning 
activities which the student will undertake and the method of evaluation and grad- 
ing to be used. Poppen & Thompson (124) did not find significant differences in 
achievement on examinations or case study reports between students taking an 
educational psychology course under the contract method and a control group 
taught by undescribed traditional methods. 

A different sort of option was provided in Fry’s (49) study of student control of 
instruction. In this experiment students were given a deck of cards containing a list 
of questions about computers. For each question there was a videotape answer. The 
student could ask the questions in any order and could ask a tutor further questions 
after the videotape. Students in this condition were compared with students who 
received the questions and answers and videotapes of a yoked experimental subject’s 
extra questions and answers in a sequence determined by experts. A third group 
received the materials in random order. Results showed that the subject-controlled 
group learned significantly /ess than the other two groups, although subject control 
worked well for high aptitude students who were also high in a measure of inquisi- 
tiveness. High aptitude subjects low in inquisitiveness did best with the sequence 
determined by experts. One suspects that one of the factors sometimes neglected in 
sequencing research is the motivational effect of surprisingness. Randomness may 
not make sense conceptually, but it may be more fun than a sequence so logical and 
with steps so small that one knows exactly what will be coming next. Such factors 
can be taken into account in planning a sequence of instruction, but as Atkinson 
(6) has suggested, the strategy likely to be most effective in ordering instructional 
materials is likely to be “response sensitive”; i.e. one using the subject’s own re- 
sponses from moment to moment to determine what should be done next. Such a 
program will be discussed later under the heading Vocabulary List Learning. 


Student-Centered Teaching 


Student-centered teaching was one of the most frequently studied teaching methods 
during the post-World War II period and showed some revival of interest as T- 
groups flourished during the 1960s. Lloyd (91) reviewed research on student-cen- 
tered vs teacher-centered teaching at the college level and found consistency in 
results pointing to the superiority of student-centered teaching for higher level 
cognitive goals and for motivational and attitudinal changes. Three more recent 
diverse studies support this conclusion. Kohut (81) set up an experimental 6-week 
senior high school course in Colonial American History and found that a pupil- 
centric method was superior to teacher-centric on an essay examination (but not an 
objective test), and this superiority persisted in a test of retention. Dawson, Messe 


INSTRUCTIONAL PSYCHOLOGY 177 


& Phillips (32) manipulated two aspects of student-centered teaching—consider- 
ation and initiating structure—in a general psychology course. High consideration 
produced better performance than low consideration on all three criteria used— 
examination scores, bibliography, and research. Haddon & Lytton (59) followed up 
students who had been tested 4 years earlier just before completion of primary 
school in a study of the effects of “formal” and “informal” schools on divergent 
thinking. The original results had favored the “informal”® schools. On the follow- 
up 4 years later the differences persisted. 


STUDENT ACTIVITY AND PEER INSTRUCTION One of the desired features of student- 
centered instruction is a high level of student activity. In addition to the studies cited 
above, there are other indications that increased student activity and interaction 
may improve instruction. Milbers (105), for example, found that participation in 
conversation groups involving active interaction produced better comprehension, 
fluency, and vocabulary in French than attending an audio laboratory. Oliver (112) 
found that demonstrations designed to stimulate discussion in psychology classes 
influenced student attitudes favorably, including motivation for majoring in psy- 
chology. Centra & Rock (22) found that colleges characterized by high degrees of 
faculty-student interaction were particularly effective as measured by student per- 
formance on Graduate Record Examination area tests. Cobb (25) found that chil- 
dren who talk about academic material to their peers are more likely to succeed 
academically than children who pay attention without interacting with peers. Talk- 
ing with teachers may also help. Wood & Wilson (168) found that college students 
whose intellectual orientation increased during college reported more out-of-class 
interaction with faculty than did other students. Such students also reported more 
gains in specific academic skills. Thus interaction between students and peers and 
between students and faculty produces favorable educational outcomes at several 
levels. 

Habits of interacting with teachers may be developed through modeling and 
reinforcement. Zimmerman & Pike (174) found that disadvantaged second-graders 
learned to ask more questions when they observed a model asking questions and 
were praised for the questions they themselves asked. Praise alone was not as 
effective. Some generalization to a new teacher occurred. 


INSTRUCTION BY STUDENTS One method of stimulating student activity is through 
students teaching other students. In an earlier review of instructional psychology, 
Gagné & Rohwer (51) express some skepticism about the value of Jearning by 
teaching. Long (92), however, presents data indicating that serving as an experi- 
menter in a paired-associate learning experiment has a beneficial effect on later 
learning of paired associates. Comparable time as experimenter with a nonrespond- 
ing subject or as an observer did not produce the beneficial effect. 

The Free School movement has thus far produced more rhetoric than evidence. The August 


1972 Harvard Educational Review is devoted entirely to “Alternative Schools” and has several 
interesting and provocative essays. 
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In a college English class, Romig (132) found that student-led discussions pro- 
duced higher achievement by the students in the group on higher level cognitive 
outcomes (but not on knowledge), and Hamblin & Hamblin (60), working at the 
other end of the educational scale, found that peer tutoring was more effective than 
adult tutoring in teaching disadvantaged preschoolers to read. These results fit with 
earlier research summarized by McKeachie (98), indicating that student-led classes 
are effective—often more effective than teacher-led classes. 


OBJECTIVES, CONTENT TESTING, AND FEEDBACK 


The line between this and the preceding section is not always very clear in my own 
mind. Why questioning and testing are here rather than with the Keller plan and 
CAI is probably an arbitrary decision. To some extent the latter part of this section 
emphasizes what I call microaspects of instruction—variables, such as feed- 
back, affecting learning in a single class period regardless of the general teaching 
method. 


Objectives 


Probably the single strongest pressure exerted by educational psychologists upon 
teachers is the importance of having specific behavioral objectives. While such an 
exercise may be good for the soul, it appears from the study of Jenkins & Deno (71) 
that it may not be particularly helpful for instruction. In a study in which (a) 
teachers, (6) students, or (c) both teachers and students were given either general 
or specific objectives of an instructional unit, no significant differences in student 
achievement were found. Similarly Olson (113) found that giving students specific 
behavioral objectives vs general understanding objectives did not make a difference 
in learning from an instructional program in interior design. Geis’ sensible review 
(52) similarly finds no significant effects of behavioral objectives. Note that these 
results do not tell us that the process of working out objectives for oneself is use- 
less; perhaps the experimenters were the ones who benefitted in the studies 
above. 

Rothkopf & Kaplan (137), however, in a briefer instructional experiment found 
that more intentional learning resulted from specific than from broad objectives 
given learners. Incidental learning was not affected. Both incidental and intentional 
learning were favorably affected by explicit description of instructional goals as 
compared with directions simply to learn everything. 


CURRICULUM A major national evaluation of a new physics course was reported 
by Welch & Walberg (164). In a well-designed controlled experiment, the Project 
Physics course proved not to be more effective than traditional courses in achieving 
the cognitive goals for which it was designed, but affective and attitudinal outcomes 
were favorable to the Project Physics course. An arithmetic program, Project 
SEED, was also evaluated in a large scale project and proved to be superior to 
conventional programs on standardized achievement tests (Dysinger & Rosenfeld 
35). 
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STRUCTURE OF CONTENT AND INFORMATION PROCESSING 
STRATEGIES 


Interest in cognitive structures and information processing strategies has generated 
research studies dealing with individual differences in the strategies used by learners 
and studies dealing with strategies of presenting content. 


Audiovisual Elaboration 


Mallory (93) showed that second-grade pupils were helped more than kindergarten- 
ers by presentation of noun pairs to be memorized in either auditory or visual 
elaborative contexts linking the two nouns. Children could be classified in terms of 
their use of the auditory or visual information processing strategy and this proved 
to be rather stable. 

Increasing the elaborative context of material to be learned may not always be 
helpful. A realistic educational TV presentation, for example, proved to be less 
effective for college level students than a program with relatively plain, less realistic 
materials in the research of Barrington (10). This result may be surprising to those 
not familiar with earlier research on instructional films and television which showed 
that “bare bones” presentations were more effective than those in which color, 
realistic detail, or other features not essential to the learning desired were added 
(Adams, Carpenter & Smith 1, Carpenter & Greenhill 20). Increased meaningful- 
ness aids learning up to a point, but complete realism may add too many irrelevant 
cues if the goal goes much beyond recognition. In summary, a structure of content 
which helps link critical features in some meaningful way aids retention and applica- 
tion; presentations which add complexity that is not relevant reduce effectiveness. 


Classification Learning 


When one is trying to teach a classification system such as biological types or types 
of music or literature, should one present several examples of one type first or should 
examples of the different types be intermixed? Sanders, DiVesta & Gray (140) 
confirmed the advantage of the former method but showed that this method was 
not particularly helpful for subjects who were not systematic in their formulation 
of hypotheses. Friedman & Greitzer (48) showed that presenting attributes before 
names was more effective than names before attributes, suggesting that this is 
conducive to a coding and retrieval strategy organized heirarchically. They point 
out that this advantage was not found in previous research in which there were more 
attributes than names. Similarly Stolurow (149) showed that children learned to 
identify parts of speech better when the sentence and part of speech were not varied 
simultaneously in the training program. Shifting both aspects complicated the task 
too much. All of these studies are reminiscent of the old controversy over part-whole 
learning and suggest a reformulation of part-whole problems in terms of a cognitive 
strategy in which a smaller, simpler set is learned before a larger or more complex 
related set of elements. 
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Vocabulary List Learning 


When one is trying to teach a vocabulary list for a foreign language, such as 
Japanese, how should the lists of words be presented? English-Japanese? Japanese- 
English? Horowitz & Gordon (64), using the standard verbal learning premise that 
this task involves two components—associative learning and response learning— 
demonstrated the transfer value of pretraining on each component, 1.e. training 
associations through presentation of Japanese-English pairs and training responses 
separately (such as by having subjects give the Japanese word when simply given 
some of its letters, no English). Such pretraining was not, however, as efficient as 
simple practice on the criterion English-Japanese pair learning. Intermingling the 
pretrained tasks improved pretraining effectiveness. 

Atkinson & Paulson (8) describe the steps necessary for developing an optimal 
instruction strategy, using as an example the learning of Swahili vocabulary. A 
parameter-dependent strategy taking individual differences into account proved to 
be more efficient than an all-or-none model or a standard procedure not taking 
individual differences into account. Even more dramatic differences were found in 
an experiment by Atkinson (7) using the task of learning German vocabulary. In 
this case a random presentation of items produced correct responses a week later 
of 38%; when subjects had selected their own items for study the percent correct 
was 58%, a performance equivalent to that achieved when items were selected on 
the basis of values of parameters estimated on the basis of a pilot experiment; when 
a separate set of parameters was calculated for each item with the values determined 
by the subject’s past performance on the item the percent correct rose to 79. Thus, 
despite our earlier discussion of the lack of effectiveness of simple programs, it 
appears that when one’s theory of learning is good enough, takes account of individ- 
ual differences, and is adjusted in terms of immediate past experience, planned 
instruction can be superior to a random order or student-selected presentation (at 
least for a single 2-hour lesson). 


Learning by Discovery vs Learning by Rule and Inductive Method vs 
Deductive Method 


Studies cited earlier have looked for interactions of student characteristics with 
inductive and deductive teaching methods affecting learning. It may well be that the 
paucity of clear results is due to the complication of another interaction—method 
by outcome, as exemplified in a complex study of learning the concept of trochaic 
meter in poetry (Tennyson, Wooley & Merrill 156). 

Mayer & Greeno (95) found that in teaching the concept of binomial probability, 
students taught by being given the formula with rules performed better on transfer 
to familiar items, while those taught more conceptually were better able to identify 
unanswerable problems and to answer questions about the formula. Mayer & 
Greeno suggest that the instructional differences produce differences in the degree 
to which the concept is connected to internal mathematical structures (required in 
familiar problems) or to external cognitive structures resulting in general under- 
standing. Shavelson (143) used diagraphs to analyze the content structure of a 
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segment of high school physics and showed that the learner’s cognitive structures 
came to correspond more closely to the content structure over the course of instruc- 
tion— not an unexpected finding, but an interesting measure. 

Egan & Greeno (36) found no differences in effectiveness of discovery and rule 
programs of teaching binomial probability, but found that individual differences in 
relevant abilities and knowledge were more predictive of success in learning by 
discovery than in learning by rule. Low ability subjects did better when instructed 
by rule. As in Mayer & Greeno’s research, the discovery method was superior in 
transfer to problems requiring more interpretation or relating to previous knowledge 
(“external connectedness” in Greeno’s terms). Learning by rule, however, was 
superior on familiar problems. Similarly, Goldman (58) found that psychology 
students using a logical strategy for learning statistics and experimental psychology 
did better on test scores and a research proposa! than those using a mnemonic 
Strategy. 

However, in a larger field study comparing three English junior schools, Richards 
& Bolton (128) found that the schools stressing the discovery method produced 
poorer achievemement in mathematics both on traditional tests and on a test of 
understanding than schools using a traditional or a combination of traditional and 
discovery approaches. One of the problems of applying treatment-outcome interac- 
tion research in the schools is the probability that a further interaction term 1s added 
—the teacher. 


Communication Style 


Even though most classrooms are characterized by a high level of teacher talking, 
relatively little research has been done on what the teacher says or how it is said. 
Smothergill, Olson & Moore (145) manipulated the amount of elaboration of chil- 
dren’s comments in a preschool. Positive effects of elaboration upon children’s 
verbal performance were found. Types of examples used by teachers, such as those 
with a racial reference, made no difference in learning of black and white college 
students in a study by Trumpeter (159). 

Despite the great current interest in the effects of models on children’s behavior, 
a gap remains in research on the effect of a teacher’s verbal and conceptual behavior 
on students. Moreover, the research on modeling has largely been focused at the 
younger ages. If models play an important part in development, college students 
should have well-learned skills in learning from models, yet, so far as I know, no 
one has studied the effects of different teacher models on cognitive outcomes of 
college courses or the variables which influence the extent to which students incor- 
porate a cognitive model. 


Questioning 


The insertion of questions in prose materials has been shown by a number of studies 
to facilitate learning. In two nicely controlled studies McGaw & Grotelueschen (97) 
investigated the direction and generality of this effect, finding that the questions 
facilitated performance through stimulating review of material preceding the ques- 
tion and through increasing attention to the material following the question. Boyd’s 
(18) results support this theory. 
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Other cues focusing attention or increasing meaningfulness can also aid learning. 
Johnson et al (73) found that adding diacritical marks to elementary school reading 
material improved performance both in reading (on nonmarked material) and spell- 
ing. These results on the effect of focusing attention fit with Rothkopf’s concept of 
mathemagenic activities (136), those activities that give birth to learning. Rothkoff’s 
concept grew out of his finding that knowledge of results does not affect learning 
directly but does affect inspection activities of students. 

Rothkopf (136) replicated the earlier finding that questions asked by a teacher 
during individual study were more effective than written questions. The later study 
showed that the facilitative effect was dependent upon text relevant questions and 
not social interaction. If, as Zajonc (169) suggests, social facilitation involves in- 
creased arousal, the effect of the arousal is apparently focused by the questions. 

Another theory relevant to the effect of questioning is Berlyne’s theory of epis- 
temic curiosity (13). In Berlyne’s original demonstration (14) subjects given a ques- 
tionnaire before reading statements with answers to the questions recalled more than 
subjects not given the questionnaire. Frick & Cofer (47) repeated the Berlyne study 
with a better control on the test items and obtained the same results. Watts & 
Anderson (163) found that questions requiring application of principles to new 
examples were more effective than those simply involving recall. 

The use of questions in reading material thus helps focus the attention of the 
learner and stimulates review and application. In addition, questions can create 
motivation by arousing curiosity. 


Testing and Grading 


One of the classic studies of educational research is McCluskey’s (96) finding that 
students studied more effectively for essay than for objective tests. This fits with the 
Watts & Anderson finding cited above if one assumes that students see objective tests 
as requiring recall more than application. Mellenbergh (103) failed to find McClus- 
key’s difference. Students given weekly quizzes requiring eighth graders to draw 
inferences were compared by McKenzie (100) to students given quizzes on recall 
of facts. He found the inference quizzes to be more effective in terms of ability to 
draw new inferences, with no loss in recall of facts, a result very comparable to that 
found by McCluskey and to King & Russell’s (78) finding that students instructed 
to study a passage for meaning remembered more verbatim details than a group 
instructed to memorize it. Thus here again we find that information processing 
strategies involving meaning and logic (as in Goldman’s study reported above) are 
more efficient than those relying upon memorization. 

Pack (118) compared the effects of norm-referenced vs criterion-referenced tests 
on student attitudes and found that criterion-referenced testing in which the student 
is not compared with others produced more favorable attitudes toward the subject 
matter. Veroff’s theory of the development of achievement motivation (160) would 
suggest that one should expect age-level differences in responses to competitive vs 
criterion-referenced standards of achievement. 

A neat, but probably not earthshaking, finding is that in grading essay tests, 
graders are affected by neatness in handwriting if they are themselves neat writers. 
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Graders whose own writing is messy are not biased by neatness (Huch & Bounds 
66). 


Pass-Fail Grading 


At the college level, arguments about feedback and reinforcement have largely 
centered around grading systems. One commonly used system is to allow students 
to elect one course on a pass-fail rather than graded basis. In one of the few 
experimental studies of legal education, Lempert (87) found that students did not 
achieve as well in their pass-fail course as a control group who had also wished to 
take the course pass-fail, nor as well as would have been expected in terms of 
previous performance. Moreover, the pass-fail students’ performance in graded 
courses did not improve. Similar results were obtained by Bronfenbrenner (19). Such 
results probably cannot be generalized to the situation where a// courses are graded 
pass-fail or pass-no entry. When a pass-fail course is competing for student time with 
graded courses, pass-fail loses out when tests or due dates for term papers in the 
graded courses become imminent. Thus lower achievement on graded examinations 
for Pass-Fail students is not surprising. The question remaining is, “Are there gains 
in attitudinal or other types of outcomes?” To date few data are available. 


Feedback, Reinforcement, Reward 


Reinforcement has long been believed to affect learning, and in fact some experi- 
menters define reinforcement as any condition that increases learning. Yet Bolles 
(16) and Lipe & Jung (90) review a variety of evidence indicating that contingent 
reinforcement is neither necessary nor sufficient for operant learning. I have fol- 
lowed Walker (162) and other theorists, e.g. Estes (37), in believing that some of 
the contradictory results in this area could be resolved by differentiating between 
the informational and motivational consequences of reinforcement. In this concep- 
tion informational feedback is particularly crucial in cognitive learning. But, as the 
following section indicates, even informational feedback turns out to be more com- 
plicated than can be expressed in a simple principle that knowledge of results aids 
learning. 


KNOWLEDGE OF RESULTS Programmed learning places much emphasis upon 
prompt feedback. Olson (113) found that giving college students knowledge of 
results on quiz items failed to produce generally better final performance than no 
knowledge of results, and Oner (117) also found no effect of feedback or praise in 
learning decimals in a programmed lesson. Moreover, feedback did not interact with 
anxiety in determining performance. Thus knowledge of results does not always aid 
learning. My guess is that the crucial factor is the amount of information given by 
knowledge of results and the learner’s knowledge of what to do to achieve a better 
outcome. In typical programmed learning units with low probability of errors, 
knowledge of results conveys little information. We might expect more positive 
effects of feedback in situations in which the task is more difficult and the informa- 
tion provided thus greater. 
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CONTIGUITY Sassenrath (142) has manipulated feedback delay and found no ad- 
vantage for immediate over delayed feedback. For those who know the classical 
nonsense and motor skill learning literature, such a finding may seem heretical, but 
studies of learning of meaningful verbal materials have not found that immediate 
feedback aided retention. Sturges (150) suggests that the advantage of delayed 
feedback found in these studies and in Sturges’ own research may be due to the 
subject’s exploration and organizaton of the material before feedback. 

Programmed instruction proponents were understandably aghast to find that 
immediate knowledge of the correct response (expected to be a reinforcer) did not 
facilitate learning in programmed instruction. Like Sturges (151), Anderson, Kul- 
havy & Andre (3) suspected that the reason this result occurred was that when 
knowledge of the correct result is immediately available, students become inatten- 
tive and careless in trying to answer the question. Thus they used a computer to give 
knowledge of correct response only after the student gave an answer. Results in their 
experiments tended to be positive. Their hypothesis was further supported by finding 
that a group allowed to peek at the correct response scored lower than students who 
had no feedback of the correct response. Feedback of correct response after the 
student’s response did not, however, produce superior performance on a delayed test 
of retention. 

Delayed feedback was superior to immediate feedback in the experiment of Kul- 
havy & Anderson (83), and their design provided support for the theory that the 
delay permits potentially interfering errors to be forgotten as well as contributing 
to greater attention to the feedback. 

These experiments casting doubt on the principle of contiguity suggest again that 
the importance of knowledge of results depends upon the information provided. 
Delayed knowledge of results may be more informative than immediate and may 
stimulate rehearsal. Delayed reinforcement should work as well as (or better than) 
immediate reinforcement if the relevant stimuli and responses are familiar and the 
subject can remember what he is being reinforced for. 

The fact that delayed feedback can be effective does not necessarily suggest that 
a teacher should delay returning test papers. Most of the studies reported herein 
were carried out with delays much shorter than those common in school and often 
with programmed materials in which feedback gives little information because 
students already know most of the answers before getting the feedback. In school 
situations one is likely to have many students who are genuinely confused or 
misinformed before receiving feedback. Delay in returning papers may be damaging 
to their motivation and may be interpreted as evidence of the teacher’s lack of 
commitment to the class. 


INFORMATIONAL FEEDBACK VS PRAISE VS TANGIBLE REWARDS Tangible rewards 
can improve learning (Frase 45, O’Leary & Drabman 111, Rosenfeld 133), but 
consequences of responses differ in their effects upon learning for different children. 
Informing a child of the correctness of his response increases learning for middle- 
class, normally achieving children, but nonmiddle-class or nonachieving children 
may learn more effectively when given praise or tangible rewards (Terrell, Durkin 
& Wesley, 157, Zigler & deLabry 172, Zigler & Kanzer 173). Blair (15) similarly 


INSTRUCTIONAL PSYCHOLOGY 185 


showed that normal achievers in the third grade learned more effectively for infor- 
mational feedback or praise than for tangible rewards (toys or candy); low achieving 
children learned better with tangible rewards. Cradler & Goodwin (30) found that 
in groups of second and sixth graders, sixth grade middle-class subjects were more 
responsive to praise and symbolic feedback while second grade lower-class children 
were more responsive to candy. Portuges & Feshbach (125) found that advantaged 
children, especially girls, were more likely than disadvantaged children to imitate 
a teacher model, especially if the teacher had been positively reinforcing to children. 
Since teachers are likely to emphasize symbolic and social rewards, differences in 
achievement between social classes may be due in some part to lack of effective 
rewards for learning and lack of teacher modeling by underachieving children. 


OTHER VARIABLES INFLUENCING THE EFFECT OF FEEDBACK In addition to interac- 
tion with student characteristics, the effect of feedback depends on the type of course 
and the type of feedback given. Levin (88) found that praise for achievement-related 
skills helped spelling, but not achievement in arithmetic or language for under- 
achieving elementary school children. Levin also found that praise for achievement 
was more effective than praise for effort. 

The content of feedback was also an important variable for Means & Means (102), 
who found that for low grade-point-average students, being told that they had done 
well on an aptitude test produced better performance on the midterm in an adoles- 
cent psychology course. For high grade-point-average students, being told that they 
had done poorly produced better performance. Such results remind one of Atkin- 
son’s theory that achievement motivation is highest when probabilities of success 
are moderate. 

Teachers can be learners too, and one presumed input to their learning to be more 
effective teachers is the use of student ratings of teaching. As in the case of feedback 
to students, feedback of student rating of teaching effectiveness to college instructors 
does not necessarily produce good effects. Miller (106) found positive results of such 
feedback in only one of three classes. Centra (21) also found no overall effect of 
feedback. He did, however, find that those teachers who rated themselves more 
favorably than their students did change (as perceived by students) in the direction 
suggested by the student ratings. Klein (79) found that teachers’ positive or nega- 
tively toned verbal behavior was influenced by pupil feedback (smiling, attentive- 
ness) in the classroom. Pambookian (119) found that feedback to instructors did not 
help those who were rated as highly effective or as ineffective but did help those rated 
as being moderately effective—again fitting Atkinson’s theory. 

Pambookian’s results fit nicely with those of Flook & Robinson (44), who found 
that among students who were told their intelligence and anxiety test scores, those 
in the middle range of intelligence were the only group to show superior achieve- 
ment during the freshman year as compared to a control group not told their scores. 
Hammer (61) showed that teacher comments on undergraduate physics papers had 
a positive effect on later performance and especially when the teacher slanted the 
comments depending upon the student’s expected grade. These comments, from the 
examples given, seem to have been likely to move probability of success to an 
intermediate level, thus, in Atkinson’s terms, increasing motivation. The effect of 
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such comments was particularly great for students high in external control on 
Rotter’s I-E Scale; presumably such students are more dependent upon external 
evaluation. 


PRINCIPLES OF LEARNING AND INSTRUCTION 


The past 2 years have been bad ones for those of us who attempt to apply traditional 
principles of learning to instruction. Thorndike’s principles of learning seems to be 
crumbling. As we have seen: 

1. Knowledge of results is not necessary for learning. 

2. Delayed knowledge of results may be more effective than immediate knowledge 
of results. 

3. Rewards seem not always to function to improve learning, and the effect depends 
upon the type of reward. 

4. Errors do not seem to persist as expected (Kulhavy & Parsons 84). 

5. Careful planning of a learning program may be no better than a random se- 
quence. 

6. Learning by a sequence of small steps may be less effective than learning by larger 
jumps. 

7. Defining objectives may not help improve student learning. 

In fact, each one of the principles confidently enunciated by Skinner in The 
Science of Learning and the Art of Teaching now turns out to be untrue—at least 
in as general a sense as he believed at that time. This does not mean that Skinner’s 
attempts to influence education have been bad or that the principles are false; rather 
the attempt to make a really systematic effort of application has revealed that what 
we psychologists took to be verities are principles that hold only under limited 
conditions. This does not mean that the search for principles of learning is fruitless. 
One implication is that the traditional principles of learning apply to particular 
kinds of learning and that we now need to apply the same theoretical and experimen- 
tal rigor to the study of principles of learning concepts and meanings. Moreover, 
attempts at application have forced us to confront the need for ways of classifying 
tasks and situations if we are to generalize from laboratory findings to other situa- 
tions (Fleishman 41, Gagné 50). 

Moreover, having demonstrated that the traditional principles of learning are not 
universally valid does not mean that they are valueless for instructional psychology. 

Knowledge of results, I would aver, is important for learning when the knowledge 
provides information and the learner knows how to correct his behavior; it doesn’t 
make much difference if the learner already has a pretty good idea of how well he 
has done or doesn’t know what to do differently. 

Contiguity probably makes a difference in learning if the learner soon forgets what 
he did, but it doesn’t help when he can easily retrieve it from memory and continue 
to think about it. 

Rewards probably do affect motivation for continuing learning, since from them 
the subject learns what activities lead to rewards, but extrinsic rewards which 
strengthen immediate motivation should not necessarily be expected to strengthen 
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motivation for choosing learning activities in the absence of anticipation of the 
reward. 

Planning and guidance of learning probably is useful within limits, but there may 
be too much as well as too little. 

Moreover, some traditional principles do stand up fairly well in this more com- 
plex domain. For example: (3) active participation is better than passive learning 
if active means something more than writing out an answer one already knows; 
(b) meaningful learning is better than rote learning. 

But the differences between the old and the new principles lies not only in better 
understanding of their limitations but also in a shift from an associationist, S-R, 
approach to an information processing approach. In such research as that on 
providing memory support to anxious students and on rote vs conceptual strategies 
for presenting content, we see how information processing models suggest analyses 
of instruction that have proved to be productive. 

Without researchers’ hardheaded studies of meaningful learning in the laboratory 
and in controlled natural settings, I could have continued making well-intentioned 
speeches to college teachers on the principles of learning. Now I believe that those 
principles apply most clearly to learning nonsense syllables or to the learning of 
animals in artificial situations. Meaningful learning is both more robust and more 
complex. But the complexity, so frustrating to those who wish to prescribe educa- 
tional methods, is a reminder of the fascinating uniqueness of the learner. Fortu- 
nately, most educational situations are interactive situations in which a developing, 
learning human being engages with a situation in ways designed to meet his learning 
needs. Part of that situation is another human being who has some resources for 
instruction and some capacity to adapt to the learner. The educational outcome of 
their interaction over time depends upon many variables. I was reared in psychology 
with the dictum that most important relationships are linear. I am now an agnostic 
on linearity. I can contemplate the possibility that much of the educational world 
is characterized by complex nonlinear relationships. In each section of this review 
we have dealt with interactions—attribute-treatment interactions (ATI), teacher- 
test interactions (TTD, method-outcome interactions, and the three-way and four- 
way interactions resulting from combinations of these. The complexity is enormous, 
but education would be sterile, boring, and inhumane if the complexity were not 
there. 

Do the limitations of our current work mean that instructional research has no 
useful role in education? Not so! On the one hand it provides a safeguard against 
easy oversimplifications; it can increase the instructor’s ability to see new dimensions 
and to make finer differentiations. On the other hand it can help the instructor to 
avoid being overwhelmed by the enormity of the complexity, by providing some 
ways to structure perceptions of the learner, the group, the instructional task, the 
situation, and the teacher-student relationship. Models of the instructional process 
may limit one’s awareness and ability to change, but properly conceived they can 
be tools to increase one’s capacity for perceiving and understanding the complexity 
of this very special changing human interaction. 
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The period covered by this review held much ferment and progress for the field 
of spatial vision. This review emphasizes those major trends which I believe ought 
to be encouraged if progress is to be both solid and rapid. What are those trends? 
First, we have seen an increased awareness of the complexity of the anatomy and 
physiology that support spatial vision. This increased awareness has brought not 
only new insights but also recognition of the deficiencies of earlier explanations for 
complex phenomena. A second major development has been the growing cross- 
fertilization between clinicians and nonclinicians who share a common professional 
interest in spatial vision. This interaction has been fostered in part by program and 
publication policies of the Association for Research in Vision and Ophthalmology 
(ARVO). A third encouraging sign is the successful application of linear systems 
analysis to various problems in spatial vision. This includes the use of Fourier 
analysis both as a means of describing stimuli and as part of a model of what the 
visual system may be doing. Related to this last trend is the growing awareness that 
many of the problems of spatial vision can be profitably attacked with paradigms 
and insights derived from work on other sensory systems, particularly the auditory. 
As Gouras (88) suggested, this collaboration may indeed work for the benefit of all 
neuroscience: “... the over-all microscopic similarity of all parts of neocortex 
implies that some fundamentally similar plan is being followed for feature-detecting 
circuits in all sensory and motor modalities, and it would be exciting if vision 
research could provide the crucial insight into this master design” (88, p. 3). This 
growing interaction among specialists in various modalities would surely have de- 
lighted von Békésy, Nobel prize winning student of vision, audition, touch, and 
taste, who constantly urged such interaction (197) and who passed away recently. 

Since this review’s bibliography makes no claims to be all-inclusive, it may be well 
to indicate the usefulness of certain bibliographic compilations that may not be 
familiar to all readers. The various parts of the Science Citation Index are to be 
found in most large science libraries. The components permit searches to be made 
in a variety of modes while the catholicity of its multidisciplinary data base matches 
that of spatial vision itself. On this very score, a traditional bibliographic source for 
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psychologists, Psychological Abstracts, is quite useless. Recently the Visual Science 
Information Center of the School of Optometry at Berkeley has begun to publish 
the Vision Index, drawing from a data base well suited to most interests in spatial 
vision. While users will find an occasional subject term a little odd, the Index is for 
the most part very valuable. Finally, the recent outstanding Annual Review chap- 
ters by Brown (41) and Leibowitz & Harvey (114) provide leads to some papers that 
readers might otherwise have missed. 


MAJOR BOOKS AND MONOGRAPHS 


During the period under review, there were three major publications of book or 
monograph size. The one that is likely to have the greatest impact, both on the 
literature and pocketbook, is the volume Visual Psychophysics (105) which Jame- 
son & Hurvich edited for the Handbook of Sensory Physiology. While its price is 
an outrage, a good deal of the sting is reduced by its contents and coptous illustrative 
material. Among the chapters that are of direct concern to the present review are 
those on dark and light adaptation (Barlow 9), signal detection theory (Nachmias 
138), luminance difference thresholds by Blackwell (19), and Aulhorn & Harms’ 
chapter on visual perimetry of all sorts (7), Heinemann’s review of work on simulta- 
neous brightness induction (96), and Westheimer’s thoughtful and provocative 
treatment of visual acuity (212). This last chapter, a useful extension of Wes- 
theimer’s Annual Review chapter on visual acuity of some years back (210), is nicely 
supplemented by Fiorentini’s chapter on phenomena associated with Mach bands 
(77). While not directly concerned with spatial vision per se, three other chapters 
in the Jameson & Hurvich volume will be of particular interest to students of vision: 
Kelly’s on flicker (109), Alpern’s on eye movements (1), and the chapter by L. Matin 
(130) on the relationship between eye movements and perceived visual direction. 
Riggs & Wooten (156) have summarized much work on electrical and psychophysi- 
cal data in humans, and their treatment of evoked responses can be nicely supple- 
mented by Regan’s encyclopedic book on this same topic (154). Finally, Enoch (70) 
provides an extraordinarily lucid presentation of the logic and assumptions that 
underlie Stiles’ two-color, increment threshold technique. Since this procedure or 
others that resemble it in intent, if not in fact, is of use in studies of nonchromatic 
stimuli in spatial vision, Enoch’s presentation is most welcome at this time. The 
whole volume, including the chapters on color vision which have not been cited 
here, provides a very useful addition to a vision researcher’s reference library partic- 
ularly since earlier volumes with which it might be compared seem to have aged 
rather badly (89, 113). The only quarrel I have with this volume is its uneven 
allocation of space. While some relatively minor topics are given unlimited chapter 
space, some major topics, like acuity, are held to about half the length they merit. 
In the near future we’ll see the companion volumes of the Handbook of Sensory 
Physiology as well as the multivolume Handbook of Perception, edited by Car- 
terette & Friedman. 

A second major publishing event is the appearance of the proceedings of an 
interdisciplinary conference on visual prostheses (Sterling et al 183). A visual pros- 
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thesis is a device designed to substitute for (or supplement) impaired vision. The 
rapid development of integrated circuit technology makes possible serious discus- 
sion (196) of implanted electrocortical visual prostheses driven by the output of a 
miniature television camera mounted in the orbit. Such an unusual prosthesis would 
take advantage of the phenomenon known as visual phosphenes (visual sensations 
elicited by electrical stimulation of the brain) to produce a usable cortical represen- 
tation of visual stimuli (163). Some of the book would read like intriguing, though 
not particularly well-written, science fiction were it not for the experiments of 
Brindley & Lewin (38), reprinted in that volume and amplified considerably by 
Brindley (37). Brindley & Lewin implanted a prototype visual phosphene prosthesis 
in the occipital cortex of a 52-year-old blind patient. When an electrode in the 
cortical matrix was energized the patient saw a small phosphene spatially localized 
in a position controlled by the electrode. Two important observations were made 
which seem to be critical if this phosphene procedure is to be used as the basis for 
a practical prosthesis: first, repeated stimulation of one electrode produced no 
habituation, and second, on each stimulation the phosphene was localized in the 
same place in the visual field. Brindley and Lewin were able to stimulate clusters 
of electrodes to produce recognizable phosphene “forms” (e.g. alphabetic charac- 
ters). The patient was easily able to identify these forms. Several of the contributions 
to the conference show that present technology could produce a useful implanted 
cortical phosphene prosthesis (Sterling & Vaughan 184, Vaughan & Schimmel 196, 
Marg 126, Schimmel & Vaughan 163). Strongly influenced by research on image 
enhancement and pattern recognition, several contributions to this volume argue the 
importance of digital preprocessing, filtering, or normalizing of the visual imput to 
the prosthetic device (e.g. Sterling & Weinkam 185). Finally, a variety of nonphos- 
phene prostheses are described, some using tactile inputs to substitute for vision 
(Bliss 27). Sprinkled throughout the reported interactions among speakers and 
discussants at the conference is considerable concern for the ethical implications and 
considerations of such research. 

Another major volume which appeared during this review period is Julesz’s 
Foundations of Cyclopean Perception (106). This beautifully produced and illus- 
trated monograph covers not only the phenomena of stereo-depth that have become 
identified with the author’s name, but also many other less well known aspects of 
Julesz’s work. These include new motion phenomena discovered by Julesz and his 
collaborators, novel aspects of form perception, the analysis of symmetry and clus- 
ters, and a general extension of what Julesz calls “psychoanatomy.” This term 
signifies the use of various experimental ploys to localize the site of various percep- 
tual effects in the human visual system. 

The book is an outstanding piece of work, provocative and impressive in its range. 
The monograph contains dozens of demonstrations, in the form of analglyphs, 
which permit the reader to see himself the phenomena Julesz describes. Most 
readers find that random-dot stereograms containing many depth planes require a 
considerable amount of time for the development of the stereo percept. For example, 
the “hyperbolic paraboloid with torus” or the multiturn spiral surface need to be 
viewed for many seconds before the figures fully emerge. What the visual system 
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is doing during this prolonged period of incubation is one of the great puzzles that 
Julesz tries to model. Among the questions that are largely unexplored is why, after 
viewing one of these multidepth plane random-dot stereograms, there seems to be 
considerable “savings” in the latency required to see another, different stereogram. 
In general, I believe that many of the traditional paradigms of learning (massed vs 
distributed practice, savings, transfer, forgetting etc) could be studied profitably 
with the Julesz random-dot stereograms. The Julesz patterns lie on the interface 
between memory and vision, as may the McCollough effect (121) and its very 
long-lived relatives. Such patterns may provide novel and important insights into 
the research domains that lie on each side of that interface. Nonstereo random-dot 
patterns are beginning to receive increased attention in clinical and applied contexts 
(Gorian et al 87, Cunningham & Galbraith 62), a trend that is likely to accelerate. 

The appearance in print of the Canberra Symposium on Vision within months of 
its occurrence presents a refreshing and fortunate precedent. The issues of Investiga- 
tive Ophthalmology that contain the proceedings are veritable gold mines of signifi- 
cant and current developments in vision, with an emphasis on physiology and 
anatomy but with a sprinkling of psychophysics as well (Day 64, Julesz 107, Over 
142, MacKay 123, Rushton 157). 


THE “GREENING” OF VISUAL PHYSIOLOGY 


Many of the Canberra papers on the anatomy and physiology of the mammalian 
visual system will be seen as a mixed blessing to those vision researchers who 
thought that Hubel & Wiesel (101, 102, 104) had provided at least the outline of 
a satisfactory understanding of the organization of the visual system. Just when the 
literature was on the threshold of assimilating the work of Hubel & Wiesel on 
receptive field organization, and just as nearly every psychophysical paper had at 
least one gratuitous reference to Hubel & Wiesel, we find that the gospel is in serious 
need of revision. The standard assumption made by those who use the Hubel & 
Wiesel hierarchy to “explain” spatial vision is that the visual system is primarily 
serial in its organization (Lindsay & Norman 116, Stent 181). This means that 
receptive field structure at the level of bipolar cells (Werblin 208) determines, via 
appropriate synaptic connections, receptive field structure at the level of the retinal 
ganglion cells (Cleland & Levick, 53), which are in turn the primary determinants 
of receptive field structure at the lateral geniculate nucleus (Levick, Cleland & 
Dubin 115). Somewhat closer to the focus of most students of spatial vision, the 
gospel proclaims that lateral geniculate cells begat simple cortical cells, which begat 
complex cortical cells, which begat hypercomplex cells (of Type I), which begat 
hypercomplex cells of Type II, etc. 

At the risk of repeating well-known arguments, it may be worthwhile to outline 
the implications of some of these relationships as they bear on spatial vision. In 
repeating them here, the emphasis will be on the deficiencies of the standard inter- 
pretation given such physiological data. To anticipate somewhat, we can classify 
these deficiencies into three categories: first, earlier treatments largely ignored the 
role of midbrain structures in spatial vision; second, there is a growing recognition 
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that the retinogeniculate pathway itself is not homogeneous—containing at least two 
major classes of subchannels, differing in latency, linearity, etc; third, there 1s a new 
emphasis on the potential of the visual system for parallel processing and informa- 
tion transfer. 


Hubel & Wiesel and Their Misuse 


Recordings from ganglion cells of the human retina (Weinstein, Hobson & Baker 
204) as well as recordings from the intact human visual cortex (Marg, Adams & 
Rutkin 127, Marg & Adams 128) suggest that responses in our own visual systems 
are not dissimilar to the responses at comparable levels of the visual systems of lower 
animals. Given this presumptive evidence of the general similarity of the human 
visual system to those of cat and monkey, let’s consider what Hubel & Wiesel say 
about the visual cortex. 

In their beautiful pioneering work, Hubel & Wiesel described the basic nature of 
synaptic organization of the geniculostriate pathways of cat (101, 102) and monkey 
(104). Simple cortical cells, presumably as a result of the nature of their connections 
to lateral geniculate cells, respond most strongly to a visual stimulus of a certain 
orientation and luminance sign (i.e. whether it is lighter or darker than the back- 
ground), and require that the stimulus be within a certain narrowly defined region 
in a frontoparallel projection. Simple cells have two other characteristics which are 
of interest. First, they respond to increasing length of stimulus with increasing 
response up to some critical length, beyond which response is invariant. Second, 
they respond to increasing width of a stimulus with increasing response up to a 
certain value, beyond which response may diminish. All of these characteristics may 
be developed in a straightforward manner from an understanding of the connections 
of the simple cell to the lateral geniculate cells from which it functionally rises. Such 
simple cortical cells have been termed “bar detectors,” “line detectors,” or “‘orienta- 
tion detectors,” and they have been treated by some psychologists as though “a 
single ‘simple’ cortical unit may signal for a rectangle of a certain orientation at or 
below a certain width and corresponding to a particular retinal locus” (Weisstein 
205, p. 159). Somewhat less spectacular claims have often been made for single cells 
of the Hubel & Wiesel type. For example, in a standard text on physiological 
psychology the following statement is found: “. . . the characteristics of stimuli that 
are effective in activating cells are those what are often said to be mediated by 
‘higher’ processes such as ‘perception.’ Perhaps the most fundamental importance 
of these experiments is that they remove much of the mystery from ‘perception.’ 
Abstract aspects of stimulus quality are coded by single nerve cells and do not 
require mysterious gestalt fields, a ‘scanning sensor,’ or a ‘man-within-a-man’ to do 
the perceiving” (Thompson 191, p. 351). 

While such cells may very well play a role in the analysis of orientation or other 
stimulus aspects, it must be pointed out that any one of these cells cannot by itself 
signal uniquely either orientation or the length of the stimulation line or any other 
property to which it responds differentially. The reason for this is obvious. Since the 
response of a cell is in fact the joint outcome of several stimulus dimensions, 
variation in any of them cannot be signaled unequivocally or represented by the 
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output of one simple cell. In other words, by judicious choice of stimulus values 
along all dimensions of the cell’s sensitivity it would be theoretically possible to have 
a large number of stimuli all adjusted to produce exactly the same response from 
the simple cell. Perhaps an analogy with color vision might reinforce this idea. 
Imagine a retina with only one photopigment type. The response of that photopig- 
ment is jointly determined by the number of quanta and the photopigment’s sen- 
sitivity to the wavelength of those quanta. We can produce exactly the same 
response in the photopigment by a combination of lots of quanta at a wavelength 
where the photopigment is not very sensitive (that 1s one at which the probability 
of a quantal absorption is low) or by stimulating with fewer quanta but at a 
wavelength to which the photopigment is more sensitive. Since its response is 
determined along two stimulus dimensions, we cannot derive wavelength related 
information from its output. We will return later to the implications of this point. 

The next stage of analysis in the cortex results in receptive fields that seem to be 
the product of synaptic interaction among several] simple cells. This stage of analysis 
is represented by the complex cell. Complex cortical cells show specificities for 
stimulus orientation and width approximately the same as those found in simple 
cells but differ from the latter in two major ways. First, complex cells show direction 
specificity—that, is a given cell may respond maximally to a stimulus moving in one 
direction and be totally unresponsive to movement in the opposite direction. Second, 
a complex cell seems less fussy about its x-y position requirements than is a simple 
cell. A complex cell will respond to an appropriately oriented line over a wide range 
of x-y positions. Such a lack of position specificity in the complex cell has frequently 
been used to explain the position invariance shown by various perceptual phenom- 
ena. Unlike simple cells, complex cell receptive fields did not appear to be composed 
of obvious on and off areas. Hubel & Wiesel went on to suggest that complex cells 
may be wired together to determine hypercomplex cell receptive field organization. 
Hypercomplex cells of Type I exhibit a novel property, namely an extreme sen- 
sitivity to length of line or stimulating bar. Such a cell will show increasing response 
as a function of the length of a properly positioned and oriented bar until a critical 
length is reached. Beyond this length, response drops, sometimes quite rapidly. This 
inhibition or “end stop” can be explained on the assumption of not too implausible 
presynaptic inputs from complex cells. An inhibitory input to the hypercomplex cell 
from a complex cell is adequate to explain the sharp drop in response as the stimulus 
is elongated beyond the excitatory region of the receptive field (Dreher 67). A second 
type of hypercomplex cell, perhaps deriving from Type I connections, shows sen- 
sitivity to stimulus angles. This rapid review of cortical organization ignores features 
that are of considerable importance to spatial vision. These features include the 
processing of retinal disparities (Pettigrew 148, Bishop & Henry 17) and the col- 
umnar organization so intensely characteristic of the visual and other sensory 
cortical volumes (24). 


Parallel and Serial Organization 


Because the hierarchical scheme described above may require some modification, 
a few remarks about it are in order here. First, it should be recognized that the 
existence of a hierarchy does not necessarily imply a strictly serial system. This 
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rather obvious point is often forgotten (as is pointed out in the discussion by Hubel 
and Stone in 186). The fact that receptive field organizations of all kinds are found 
in the axons which cross the corpus callosum (Hubel & Wiesel 103) indicates that 
there may be a good deal of parallel organization along with the basic hierarchy 
which Hubel & Wiesel describe. While the precise connections of these fibers that 
carry visual information between hemispheres is quite complicated, the implications 
of their receptive field types is more clear: information from cells of all types— 
simple, complex, etc.—is available to whatever centers are interested in such infor- 
mation. This fact is quite slow to impress students of spatial vision who have found 
it more convenient to pretend the system was not only hierarchical, but also serial. 

A related issue is raised by attempts to identify contrast analysis with the opera- 
tion of retinal ganglion cells. It has often been assumed by psychophysicists and 
physiologists that by the time the retinal image is coded into impulses along the optic 
nerve, that code reflects substantial analysis of spatial contrast. For example, simul- 
taneous contrast in man has been attributed to the spatially opponent organization 
of retinal ganglion cells. Observations by Maffei & Fiorentini (125) cast doubt on 
this thesis. They show that the inhibitory aspects of lateral geniculate cell receptive 
fields arise not from the inhibitory aspects of retinal ganglion cells (as had been 
thought) but rather from centers of retinal receptive fields, albeit different centers 
than give rise to the centers of the geniculate cell fields. They argue that the 
opponent organization at the retinal level is essentially a sensitivity control mecha- 
nism (see 208 for related arguments), and that the analogous opponent organization 
of the geniculate level does not arise from that of the preceding stage. Another 
important finding reported by Maffei & Fiorentini is that unlike their retinal gan- 
glion cell counterparts, the strength of the opponent organization at the geniculate 
is unaffected by dark adaptation. The implications of these results for psychophysics 
will be considered later. 

Bishop & Henry (18) expressed their dissatisfaction with two aspects of previous 
work on cortical receptive fields: first, as most readers will appreciate, there has been 
less than complete rigor and clarity in the manner of defining and classifying cell 
types; second, relatively little attempt has been made to describe the “intrinsic 
structure or regional organization within the receptive field” (18, p. 347), with most 
workers being satisfied merely to map the outlines of the field. This latter complaint 
has begun to be settled (Henry & Bishop 97), and there is yet hope that eventually 
we will know as much about the detailed regional organization of cortical cells as 
we know about that of retinal ganglion cells. Bishop & Henry (18) described their 
technique for plotting and classifying receptive fields with admirable clarity. While 
Bishop & Henry agree that the Hubel & Wiesel classificatory scheme is generally 
useful, they quarrel with certain details: first, they claim that fully three quarters 
of the simple cortical cells show directional selectivity in their response; second, they 
describe cells which are hypercomplex in that they show stopped-end inhibition but 
resemble simple cells in having distinct on and off subregions. These exchanges and 
disagreements among Hubel, Bishop & Henry in the Canberra Symposium (18, 97) 
give the reader a clear idea of the degree of agreement on the organization of the 
visual cortex. 
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Subchannels in the Primary Afferent Pathway 


The distinction that Enroth-Cugell & Robson (71) made between two distinct types 
of retinal ganglion cells has been extended to the lateral geniculate nucleus. Enroth- 
Cugell & Robson characterized retinal ganglion cells as X-type (linear summation 
. of center and surround, sustained responses, and slowly conducting axons) or as 
Y-type (nonlinear summation of center and surround, transient responses, and more 
rapidly conducting axons). Stone & Hoffman (187) and Cleland, Dubin & Levick 
(54) showed that X- and Y-type retinal ganglion cells tend to drive separate cells 
in the lateral geniculate of cat. 

Moreover, such X- and Y-type lateral geniculate cells may constitute separate 
geniculostriate channels (differing in latency, for example) and perhaps even have 
different cortical cells subgroups as their target (Stone 186). Stone has argued, in 
addition, that complex cortical cells may derive primarily from the Y channel while 
simple and hypercomplex cells derive from the X channel. Certain properties of the 
receptive fields of the cortical cells in question are consistent with this speculation. 
This controversial argument is further complicated by Hoffman’s (100) claims that 
X-type retinal ganglion cells do not project to the superior colliculus either directly 
or indirectly, but that Y-type cells do, after relaying through geniculate and cortex. 
While the value of this carefully preserved segregation of visual information into 
separate subchannels remains to be discovered, it is well to remember that neither 
all retinal ganglion cells nor all cells of the lateral geniculate nucleus are likely to 
prove equal. 


Contributions of the Superior Colliculus 


Several years ago it might nearly have been possible to draw a decent diagram of 
the main afferent pathways in the mammalian visual system (3) without lifting one’s 
pencil from the paper and (b) without recrossing very many lines. The recognition 
that the colliculus is a structure to be reckoned with in the scheme of things 
(McIlwain 122, Gordon 86) makes this feat no longer possible. Introducing his 
survey of the anatomy of the superior colliculus, Lund described the colliculi as 
“concerned with aspects of visual attention and centering of the visual image on the 
retina” (119, p. 435). His paper reminds us that colliculi are anatomically (Sprague 
179) and functionally heterogeneous structures. Virtually every investigator of the 
superior colliculus has found it useful to distinguish (at very minimum) between 
“superficial” and “deep” laminar divisions. Roughly speaking the superficial layers 
are concerned with strictly visual matters, receiving inputs from both contra- and 
ipsilateral retinas as well as from ipsilateral visual cortex. Its outputs go to several 
places in the visual system. Inputs to the less superficial regions of the colliculi carry 
information from several nonvisual modalities, from the superficial region of the 
ipsilateral colliculus, as well as from the contralateral colliculus. Gordon (86) sum- 
marized her work and that of her collaborator Sterling (Wickelgren, 214; Wick- 
elgren & Sterling, 215) on both superficial and deep aspects of the colliculus. The 
duplication of retinotopic maps in midbrain and cortical structures presents a great 
puzzle to the casual observer, a puzzle which is not lessened by the probability of 
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species differences in collicular function (Schiller 162). Cells in the superior col- 
liculus are less fussy in their stimulus requirements than are cortical cells. For 
example, the sharp tuning to direction of movement or stimulus orientation, charac- 
teristic of so many cortical cells, seems to be absent in the colliculus. Studies of 
cortical ablation suggest that at least two visual receptive field characteristics of 
collicular cells, their binocular sensitivity and large receptive field diameters, derives 
from corticotectal rather than retinotectal influences. There is a general agreement 
on the point that the superior colliculus plays a role in target acquisition and 
tracking (162) but growing disagreement on the details of this formulation. Wurtz 
& Goldberg, for example, write “we think the colliculus is involved in a shift of 
attention and facilitation of movement to an area, but not the coding of the exact 
location of the eye movement target for two sets of reasons: one is that our monkeys 
with lesions can acquire the target perfectly, but only take longer to start. The other 
is that properties of the cells are more consistent with an attention-shifting role than 
with a fine-guidance role (218, p. 595).” Goldberg and Wurtz (85) have provided 
a superb model of how one might study as complex a phenomenon as “attention” 
in a neurophysiological preparation and make a good case for the role of the 
colliculus in shifts of attention. Their discussion of difficulties in operationalizing 
“attention” will prove useful to psychologists who have in general only begun to deal 
with the implications of such phenomena for spatial vision (68, 170). 

Some of Gordon’s intriguing observations on the superficial layers of cat superior 
colliculus (86) are worth mentioning in detail despite the fact that some of those 
observations are yet to be confirmed in higher species (Goldberg & Wurtz 84). One 
of the striking characteristics of those superior colliculus cells that exhibit direc- 
tional sensitivity is the anisotropic character of that sensitivity. In general, such cells 
tend to be more responsive to movement outward from the center of the field 
(centrifugal) than to movement inward from the periphery (centripetal). This direc- 
tion preference, Gordon has suggested, helps the colliculus determine “ ‘where the 
stimulus is and where it is going’ in order to bring the stimulus into the center of 
the visual field and keep it there.” Recording from deeper layers of the cat superior 
colliculus, Gordon found striking evidence that a cell, which could be driven either 
by visual stimuli or by auditory stimuli, had its two receptive fields (auditory and 
visual) in the same area of perceptual space. The colliculus then has not only a 
topographic map of visual space, but also a topographic map of auditory space and 
a given cell monitors roughly the same area of space in both modalities. Moreover, 
directionally selective cells in these deeper collicular layers tended to have the same 
direction preference in both input modalities. Such cells, with analogous preferences 
in each input modality, might explain a variety of puzzling phenomena. For exam- 
ple, Wertheimer (209) noted that within minutes after birth a human infant seemed 
to have a workable, crude common map of visual and auditory space. While there 
might not be a direct causal relationship between the bimodal! superior colliculus 
cells which Gordon described and any human behavioral phenomena, it is clear that 
there is increasing interest in bimodal effects in spatial vision (Engel & Dougherty 
69, Day 65, Morrell 134, Dougherty, Jones & Engel 66). We may be approaching 
the point where it will no longer be premature (Stent 182) to explore bimodal effects 
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in spatial vision. While engaging in this wild speculation, it might be appropriate 
to suggest that the psychophysical literature already contains reports of differential 
centrifugal and centripetal motion sensitivity (Scott, Lavender, McWhirt & Powell 
165), not unlike that which Gordon describes for collicular cells. These data are 
consistent with Richards’ controversial claim that certain aspects of human motion 
perception are mediated by the superior colliculus (Richards 155). Even though 
such cells may actually play no role whatever in human spatial vision, they do 
provide a framework within which certain experimental manipulations make sense 
and without which they would be unthinkable. 


Psychophysical Linking Hypotheses 


Clearly, one of the recurrent themes in the study of spatial vision has been an interest 
in correlating observations made at different levels of analysis. Whether these corre- 
lations are attempted at the level of the inner plexiform layer of the retina or at the 
level of the hypercomplex cells (Nakayama & Roberts 140), some rules of transla- 
tion between the levels must be assumed. The rules that relate visual or perceptual 
events to their physiological correlates are psychophysical linking hypotheses. 
Brindiey (36) has written of the need and difficulty involved in making such rules 
or assumptions explicit. As one goes through the literature on spatial vision one is 
struck time and again by the failure to make and test explicit and well formed 
psychophysical linking hypotheses. Since such hypotheses are critical for the ad- 
vancement of spatial vision, it may be worthwhile to consider them and their 
implications. 

Brindley proposed that there may at present be only one generally accepted 
psychophysical linking hypothesis: “Whenever two stimuli cause physically indis- 
tinguishable signals to be sent from the sense organs to the brain, the sensations 
produced by these stimuli, as reported by the subject in words, symbols or actions, 
must also be indistinguishable” (p. 133). By accepting this argument we can make 
statements about the indistinguishability of stimuli a and 6 under conditions x and 
y. Brindley also proposed that we may adopt without question the converse of this 
proposition: physiological events which are distinguishable from one another give 
rise to different perceptual effects. Observations which contain statements about the 
distinguishability or indistinguishability of sensations are the constituents of Class 
A experiments. Such experiments include matching and threshold experiments of 
all kinds. Observations that cannot be expressed as the identity or nonidentity of 
sensations are Class B observations. These include descriptions of entoptic phenom- 
ena, response time experiments, scaling experiments, etc. 

Because Brindley recognizes the validity of only the one general psychophysical 
linking hypothesis described above, he has proposed that Class A experiments must 
necessarily be more informative of the underlying physiology than are Class B 
experiments. Other people have from time to time proposed sets of linking hypothe- 
ses, including Boynton & Onley’s (31) useful elaboration of Brindley’s scheme. Some 
time ago, Boring (29) presented an operational form of G. E. Mueller’s psychophysi- 
cal axioms which have considerable merit and provide a good companion to Brind- 
ley’s argument. Among the points which Boring made is that the possibility of finite 


SPATIAL VISION 205 


“just noticeable differences” on the sensory continuum make it somewhat risky to 
accept without question the proposition that different physiological events neces- 
sarily give rise to behaviorally defined Just noticeable differences. 

Brindley has proposed that it is possible to convert many Type B observations 
(e.g. the Purkinje shift) into equivalent Type A observations (e.g. thresholds for light 
of various wavelengths as a function of adaptation level, photopic through scotopic). 
While this may very well be true, the rules by which one translates from Class B 
to Class A are not always clear. For example, Brindley suggested (36, p. 146) the 
waterfall illusion (motion aftereffect) is a Class B counterpart of observations made 
by Pantle & Sekuler (145) at threshold level. Pantle & Sekuler, extending the earlier 
observations of Sekuler & Ganz, found that viewing contours which move in one 
direction selectively raises the luminance threshold for contours which move in that 
same direction. Brindley’s assertion that these threshold observations are Class A 
equivalents of the waterfall illusion is probably wrong. Sekuler, Lehr, Stone & Wolf 
(168) showed that the threshold effect of viewing moving contours behaves quite 
differently from the suprathreshold effect (waterfall illusion) under certain condi- 
tions of stimulation. 

This implies that we must be careful in extrapolating from threshold observations 
to suprathreshold ones, and that it is not always possible in an a priori fashion to 
specify the Class A counterpart of any given Class B observation. The following 
example might give additional insight into why this is so. Consider a single, simple 
cortical cell of the kind described by Hubel & Wiesel. Its response is determined by 
several stimulus dimensions. Such cells cannot singly signal unequivocally the orien- 
tation of a stimulus in its receptive field (or, for that matter, any of the other 
properties to which it is sensitive). A given response level from one of these simple 
cells could be produced by any of an infinite number of combinations of stimulus 
orientation, length, width, etc. This means that the suprathreshold appearance of 
stimuli, insofar as simple cells contribute at all, must be determined by the pattern 
of cross-cell activity (cf. Erickson 72), by the pooled or summed activity levels of 
very many cells. On the other hand, it is possible to assume that the detection 
threshold for a given target is controlled by one, or very few, cells—the one or ones 
that would be optimally stimulated by the particular combination of stimulus prop- 
erties possessed by the target. This conception owes an obvious debt to the domain 
of color vision. While the suprathreshold appearance (color) of a stimulus is the 
product of the pooled, weighted response of several wavelength sensitive channels, 
the detection threshold for a given target is determined by that channel which is 
most sensitive, given the prevailing combination of test wavelength, background 
wavelength and luminance (70). In this vein, then, Sekuler, Lehr, Stone & Wolf 
(168) have argued that while the detection threshold for contours moving downward 
through the field is determined exclusively by mechanisms sensitive to downward 
motion, the suprathreshold appearance of a target’s direction of motion is jointly 
controlled by several mechanisms, each sensitive to a different range of directions. 
One of the key unsettled questions of spatial vision then is the relationship between 
threshold and suprathreshold observations. Before we can establish clear rules for 
psychophysical linking hypotheses, we must be in a position to make some general 
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statements about the relationship between the levels of psychophysical observations. 
This question is critical for any attempt to make predictions of psychophysical 
observations from the often scant data at the physiological level. Almost alone 
among neurophysiologists, Hirsch (98) has considered this problem. 


ANALYSIS OF SPATIAL FREQUENCY RESPONSES 


A promising new approach to spatial vision is one that makes use of spatial fre- 
quency analysis. Westheimer (213) paints an amusing, but highly accurate portrait 
of subjects across the world being exposed to a staggering variety of spatial, tempo- 
ral, and even spatio-temporal sinusoidal modulations. Sinusoidal luminance distri- 
butions (gratings) have gained great popularity. Two questions clearly need 
answering: why are such odd stimuli being used and what sort of information does 
their use give us about spatial vision? 

As we will see, not all experiments which use these sinusoidal stimuli make the 
same level of theoretical commitment to those stimuli. The spatial response of the 
visual system is sometimes measured with sinusoidal inputs because the experi- 
menter assumes the visual system to be linear, at least near threshold. This means 
that the response is proportional to the input. More formally, linearity implies a 
system that obeys the principle of superposition-——if the response of the system to 
stimulus f;(x) is A(x) and the response of the system to stimulus g;(x) is 2,(%), 
then the response to the combination of A(x) + g;(x) is f(x) + g(x) for all pairs 
of f;(x) and g;(x), where / subscripts an input function defined over space and o 
subscripts an output function defined over space. A second assumption made im- 
plicity in most studies of spatial frequency response is that the system is space- 
invariant or homogeneous. This means simply that regardless of where on the input 
surface the input function is directed, the relationship between output and input will 
be invariant. Symbolically, we may state that a system is homogeneous if for all pairs 
of input positions, x, and x, £(x,) = £(4,) whenever f(x,) = f(x). A third related 
assumption, isotropy, is discussed by Cornsweet (58). Since, of the three assump- 
tions just described, isotropy is the one best met by the visual system it will be 


ignored in what follows. 
If the visual system’s spatial response can be considered linear and homogeneous, 


sinusoidal inputs are eigenfunctions for the system, i.e. the response of the system 
to a sinusoidal input will be a sinusoid of the same frequency but, perhaps, of 
different amplitude or phase or both (177). While many experiments have shown 
this assumption of linearity to be not unreasonable, for small signals at least, it must 
always be remembered that in reality the visual system is strongly nonlinear— 
possessed as it is of both threshold and saturation points. 

Sinusoids defined over space are useful in the direct, empirical generation of 
contrast sensitivity functions. These functions characterize the ability of the visual 
system to transfer information at various spatial frequencies from stimulus input to 
output. Similar plots are often found in popular photography or hi-fi magazines. The 
plot for an amplifier may relate output to input with input frequency, in the time 
domain, as a variable. In vision, the analogous plots display sensitivity as a function 
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of input frequency in the space time, i.e. spatial frequency expressed usually as 
cycles/degree. Sensitivity is defined as the reciprocal of percent threshold modula- 
tion—difference between peaks and troughs—required for the observer to distin- 
guish the stimulus from a uniform field of equivalent space-average luminance. 

There are two main uses for this spatial contrast sensitivity function, and they 
differ very much in their underlying theoretical assumptions. First, the function may 
be a quick and useful summary of the overall response of the visual system. As we'll 
see in a moment it is often important to have such a summary—much as it may be 
important to have the analogous information on a loudspeaker before purchasing 
it. Second, if the visual system were truly linear we could use the contrast sensitivity 
function to predict the response of that system to any arbitrarily chosen spatial input 
—including, perhaps, the Union Jack (213). As we will also see, experiments have 
been done which attempt to predict, from the contrast sensitivity function, how the 
visual system will respond to nonsinusoidal, small signal stimuli. To the extent that 
they are successful, we may consider the assumption of linearity not to be too far 
off. Useful descriptions of the process involved in these predictions are available (75, 
216). 


Descriptive Uses of Contrast Sensitivity Functions 


Recent years have seen the increasing use of contrast sensitivity functions to summa- 
rize not only the response of the whole eye-brain system, but of selected portions 
of that system as well. Campbell & Green (45) produced separate contrast sensitivity 
functions for (a) the optics of the eye (b) the remainder of the visual system, 
including neural structures. Ohzu & Enoch (141) produced similar functions for the 
retina of the human eye. Later, the use of the contrast sensitivity function in clinical 
contexts will be discussed (28, 92). Three examples here will provide some indication 
of the diversity of possible uses for this function. Berkley & Watkins (15, 16) used 
evoked potentials to estimate the acuity limits of the cat. Data relating magnitude 
of evoked potential to the spatial frequency of the evoking stimulus are of great 
importance in bridging the gap between single cell recording and behavior in the 
cat. Related data on the human evoked potential were reported by Campbell & 
Kulikowski (47). Finally, Shlaer (173) gave an elegant description of the optical 
transfer characteristics of the eye of the eagle, arguing that some of the incredible 
feats of acuity claimed by eagleophiles are really impossible. 


Linear Systems Uses of Contrast Sensitivity Functions 


This may be the appropriate place to repeat Kelly’s cautionary note on the role of 
Fourier analysis in the study of flicker (109), observing simply that the same caveat 
certainly holds for the case of spatial vision and spatial frequency analysis. Kelly 
pointed out that “Despite its importance, mathematical analysis of the stimulus 
alone does not constitute a theory. Some current investigators seem to regard 
Fourier analysis itself as a model of the visual process (which they call ‘the Fourier 
model’), but this is erroneous. Fourier analysis has the same status as vector analy- 
sis, matrix analysis, or any other branch of mathematics; it is one of many tools that 
are useful for constructing models. But there is usually more to constructing a good 
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theory than Just using the right tools” (109, p. 274). While Kelly’s point is well 
taken, it should be made stronger by a reminder that any application of Fourier 
methods necessarily involves assumptions about the system being analyzed. 

One particularly good example of Kelly’s argument is the study by Campbell, 
Carpenter & Levinson (43), using the techniques of linear systems analysis to predict 
the visibility of various aperiodic stimuli from knowledge of the overall, system 
contrast sensitivity function. The contrast sensitivity function was used to predict 
the visibility of a single bar (one-half cycle of a sinusoidal grating), of a double bar 
(a full cycle of the grating) and of a target in which the grating was restricted to 
one half the field while the other half was uniform and at the equivalent space 
average luminance. If we assume that the visual system, at least near threshold, is 
a linear system, one which obeys the principle of superposition, we can determine 
its response to any arbitrary stimulus by summing its responses to appropriately 
chosen components of that arbitrary stimulus. 

The line spread function is the image of a line object. This spread function, in the 
distance domain, and the contrast sensitivity function, in the frequency domain, are 
related by a Fourier transform (152). Campbell, Carpenter & Levinson (43) com- 
puted the line spread function from the Fourier cosine transform of the contrast 
sensitivity function. Since both variables are defined on the spatial domain, we may 
convolve the line spread function with the luminance profile of the stimulus, yielding 
the response of the system, also in the spatial domain. The response of the detector 
is assumed to be a function of the maximum peak-to-trough amplitude difference 
in the output of the convolution. This operation is repeated for various frequencies 
of the aperiodic stimulus. It is then possible to determine, for any aperiodic stimulus 
used, the expected output of the visual system as a function of the input frequency. 
After completing these computations, Campbell et al had only two emendation 
(“fudge”) factors available for fitting the experimental observations—one a constant 
associated with a given subject’s sensitivity as a detector of peak-to-trough ampli- 
tudes, and the other a constant which slid the predictions for a given subject along 
the x, spatial frequency axis. This study is one in which both the failures and 
successes of the theoretical predictions are informative. The predictions were quite 
good when two conditions were met: first, the data from the double bar conditions 
were excluded, and second, data primarily for higher spatial frequencies, i.e. greater 
than 10 c/degrees were considered. The failure in the case of the double bar stimulus 
(one full cycle of a grating) means that linear theory has to be “false in the extreme 
case of two single bars of opposite contrast, separated by considerable distance” (43, 
p. 296). At low spatial frequencies, with large distances between peaks and troughs 
of the convolution output, the assumptions of the model would be up against the 
obvious fact that the human retina is functionally organized into isoplanatism 
patches, finite regions each characterized by a somewhat different spread function 
(cf. Harvey & Pöppel 95, Van Doorn, Koenderink & Bouman 195). Westheimer has 
put this especially well in his arguments that “the heterogeneity of retinal structure 
makes the analysis of spatial visual function with extended sinusoidal targets some- 
what risky” (212, p. 183). The spatial grain of the retina varies perhaps by as much 
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as a factor of 10 as one goes from the fovea to the point only 10° peripheral 
(Westheimer 213). 

Even if the retina were not heterogeneous with respect to characteristic local 
spread functions, the visual system may lack the mechanisms required to detect 
differences between widely separated peaks and troughs in the retinal image. To sum 
up, while these experiments (43) make elegant use of some of the tools of Fourier 
analysis and linear systems analysis, they do not test, in the study under consider- 
ation, the possibility that the visual system operates on the Fourier components of 
the aperiodic targets (cf. Sullivan, Georgeson & Oatley 190). Paraphrasing Kelly 
rather imperfectly, this study uses the tools of Fourier analysis but does not consti- 
tute evidence for a Fourier theory. 


Multiple-Channel Models 


What sorts of theoretical interpretations can be given to the human spatial sen- 
sitivity function? What sorts of mechanisms could give rise to it? Westheimer points 
out that this standard, human contrast sensitivity function can be derived from the 
convolution of sinusoidal spatial inputs with the standard “Mexican hat” receptive 
field model profile with an excitatory center diameter of about 3 minutes (212, p. 
182). Under a variety of conditions the curve shows that the visual system has peak 
sensitivity at intermediate spatial frequencies, with decreasing sensitivity for either 
higher or lower frequencies. There are two general classes of models proposed to 
explain the function. The first, a single-channel model, considers the information 
from receptive field profiles of all sizes to be pooled prior to mechanisms which 
detect the target, say a grating. Evidence from selective adaptation experiments (e.g. 
144) supports quite a different interpretation, a multiple-channel model. This model 
assumes the contrast sensitivity function to be the envelope of sensitivities for a 
whole series (number unspecified) of separate channels, each sensitive to a limited 
region of the spectrum. If the outputs of these various channels were detected or 
measured in a relatively independent fashion, we would have one generic form of 
a multiple-channel model of spatial vision. An analogy to color vision might be 
useful. The overall sensitivity envelope of the photopic system coexists with—and 
presumably is produced by--some combination of three color sensitive channels. 
The operation of the individual channels can be measured if one goes to certain 
experimental lengths, using, for example, Stiles’ increment threshold procedure 
(70). Let us note one final point before going on to consider the multiple-channel 
model as it has developed over the past few years in spatial vision. A multiple- 
channel model does not necessarily imply that under all stimulus levels and for all 
tasks the visual system behaves as though composed of totally independent spatial 
frequency channels. To anticipate somewhat a point made at length later, indepen- 
dence of spatial frequency channels seems to hold only near threshold (with low 
contrast inputs) and there is increasing evidence of theoretically interesting inhibi- 
tory cross-talk between channels sensitive to somewhat different frequency bands as 
the responses of those individual channels increase. Moreover, the importance of 
task variables in spatial frequency analysis is just beginning to be examined (50). 
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What is the likely neural origin of these spatial frequency selective channels? This 
is a question that has recently been reopened. It has been commonly assumed that 
the first major stage of spatial contrast analysis was carried by the outputs of retinal 
ganglion ceils. Several observations reported during the review period seem to raise 
questions about this assumption. From a study of connections between retinal 
ganglion cells and their target cells in the lateral geniculate nucleus, Maffei & 
Fiorentini (125) have suggested that retinal ganglion cells, at least in cat, are not 
passing contrast information along to the geniculate cells. Moreover, two psycho- 
physical studies indicated that scotopic spatial adaptation levels do not affect sen- 
sitivity, as one would expect from current ideas about the reduction in retinal 
ganglion cell surround inhibition with scotopic conditions (90). As Fiorentini & 
Maffei put it, “To whatever extent perceptual contrast effects can be explained in 
terms of receptive field organization, they cannot be accounted for by the properties 
of retinal receptive fields. The properties of more central visual receptive fields such 
as those of the lateral geniculate body or cortex seem to account better for the 
observed scotopic contrast phenomena” (80, p. 78-79). 

A model of the visual system, if it is to be of much use, has to deal with more 
than the response to simple sinusoidal gratings. How does a multiple-channel model 
deal with more complicated stimuli—real world scenes for example? The model 
postulates that the visual system decomposes any scene into its spatial frequency 
components. Each component is then detected in a manner that is independent of 
the others (159). The general finding, at threshold, is that components of a stimulus 
which differ by as much as a factor of two in spatial frequency are detected quite 
independently of one another. For example, a complex grating (one consisting of 
more than one spatial frequency component) is detected—discriminated from a 
homogeneous field—-when the contrast of one of its spatial frequency components 
is above threshold. Imagine that we measure the threshold for a simple sinusoidal 
grating of frequency F and then measure the threshold for another simple sinusoidal 
grating of 2F frequency. Suppose further that because of their relative positions on 
the contrast sensitivity function the F grating had the lower threshold of the two. 
If we then create a compound grating with both E and 2F' components, regardless 
of their phase relationships, and measure the contrast threshold for the compound 
(again, the contrast at which the grating is discriminably different from a homoge- 
neous field of equivalent mean luminance) that threshold is numerically identical 
with the point at which the contrast of the F component itself reached threshold. 

In summary, it is important to recognize that models of spatial vision differ along 
a number of relatively independent dimensions. For example, a model may make 
assumptions about the linearity of the system being described. A model can be 
defined as multiple channel or single channel. Multiple-channel models can incorpo- 
rate various degrees of cross-talk among channels; the channels, as well as the 
cross-talk, may be either linear or nonlinear. 


Fourier Models 


An important distinction needs to be made between the multiple-channel model and 
a stronger version of that model, which claims not only a spatial frequency analysis 
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by independent channels each sensitive to particular spatial frequency band, but 
which claims that the visual system uses a particular kind of spatial analysis in this 
operation, an analysis into its Fourier components (Pollen, Lee & Taylor 151). This 
strong Fourier version of the multiple-channel model is not logically mandated by 
the findings concerned with the independence of various spatial frequency channels. 
The multiple-channel model or some version of it could be correct even if the visual 
system did not perform a set of operations isomorphic with Fourier analysis. Still 
there are many reasons for using this outlandish notion of a Fourier model as a 
working assumption. 

Fourier theory states simply that any arbitrary waveform can be analyzed into 
appropriate sine and cosine components and that the reverse process is also possible: 
that any arbitrary waveform can be approximated or synthesized from appropriate 
components. In spatial vision it is commonly assumed that retinal ganglion cells or 
cells such as simple cortical cells perform a hardware analysis whose output can be 
thought of as giving the power in each of a number of spatial frequency bands. 
Although a single-channel Fourier model of spatial frequency is logically possible 
it is easily shown to be false (Pantle & Sekuler 144). Consequently the only Fourier 
model well consider is the standard, multiple-channel Fourier model. 

Regardless of its ultimate correctness, this point of view does have several attrac- 
tive aspects: first, it serves to emphasize what may be heuristically significant simi- 
larities between spatial vision and audition (Carterette, Friedman & Lovell 52, Kay 
& Matthews 108, Stromeyer & Julesz 189, Carter & Henning 51). Second, it implies 
a useful way of quantifying certain kinds of stimuli, particularly spatially complex 
ones. The addition of certain auxiliary assumptions permits prediction of indepen- 
dence, additivity or masking interaction among stimuli. It is these auxiliary assump- 
tions that constitute a theory, not the manner in which the stimuli are described 
(109). Third, the idea of multiple and independent channels for the analysis of 
various bands of spatial frequency information fits nicely with the work of Enroth- 
Cugell and her collaborators on the sensitivity profiles of cat retinal ganglion cells, 
lateral geniculate, and cortical receptive fields (e.g. Campbell, Cleland, Cooper & 
Enroth-Cugell 44), Since the contrast sensitivity function is central to so many 
experiments testing a Fourier model it is nice that the function is easily derivable 
from first principles. Fourth, the assumption that the visual system performs some 
kind of Fourier analysis of its retinal image input provides a convenient way of 
explaining certain sticky perceptual phenomena. For example, if only the distribu- 
tion of power in various spatial frequency channels is considered, the representations 
of the stimulus patterns would be invariant over shifts in phase or with translational 
changes (Rybicki, Tracy & Pollen 158, Armstrong & Sekuler 4). This would also 
provide a ready-made explanation for the fact that we can recognize objects despite 
changes in their location in the visual field from presentation to presentation. 
Finally, the Fourier version of the multiple-channel model permits the prediction 
of a variety of nonintuitive visual phenomena (Blakemore, Nachmias & Sutton 25). 
These successful predictions represent the major achievements of the model and 
some can be profitably summarized here, followed by an indication of possible 
problems with the formulation. 
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Campbell & Robson (49) gave an explicit statement of the Fourier multiple- 
channel model. While not all their findings unambiguously require such a model, 
some being well handled by a single-channel model, other of the data do. From the 
Fourier representation of various nonsinusoidal gratings (e.g. square wave grating), 
Campbell & Robson could predict the detectability of the grating; these predictions 
required only knowledge of the contrast sensitivity function. These results merely 
show that the system is approximately linear and do not mandate the independent 
detection of various spatial frequency components, a key part of a multiple-channel 
model. By Fourier theory, a square wave grating may be treated as a combination 
of a fundamental frequency F and modulation 4/77, a third harmonic, 3/ with 
modulation 4/37r, a fifth harmonic, SF with modulation 4/57r, etc. When such a 
square wave grating was just detectable it appeared sinusoidal. Additional contrast 
was needed for it to appear square. Campbell & Robson found that from data on 
the detectability of simple sinusoidal gratings of F and 34, they could predict the 
contrast of the square wave grating at which it would first appear square rather than 
sinusoidal. This finding does seem to require the independent detection of various 
frequency components, supporting a multiple-channel model. 

As further evidence for the model, Blakemore & Campbell (20) showed that 
exposure to a high contrast square wave grating of frequency F raises the threshold 
for sinusoidal gratings of frequency F and that for gratings of 3F as well (although 
to a lesser degree, as predicted). This latter finding is expected if the visual system 
were doing a Fourier analysis on the square wave grating, decomposing it into its 
fundamental and odd harmonics. Of the odd harmonics only 3F would have ade- 
quate power to produce detectable adaptation. It should be noted that many papers 
that support a multiple-channel model (e.g. 144) find that predictions from the 
Fourier position fail at low spatial frequencies. It may well be that while the Fourier 
multiple-channel model is fine for higher frequencies, the visual system has adapted 
an alternative approach to the analysis of low frequency components of the retinal 
image. 

More recent papers have been aimed directly at the Fourier multiple-channel 
model. Sullivan, Georgeson & Oatley (190) measured the contrast threshold for 
single bars, of varying width, as well as for sinusoidal gratings of various spatial 
frequencies. Measurements were made with and without prior adaptation to grat- 
ings (sinusoidal of 5.5 c/deg or 16 c/deg) or bars (7.5 or 1.25 minutes wide). As 
expected, test gratings in the vicinity of 5.5 c/deg had elevated thresholds after 
exposure to an adaptation grating of 5.5 c/deg and a similar, localized elevation near 
the adapting frequency following exposure to 16 c/deg. The power spectrum for a 
single bar, in Fourier terms, is quite different from what one expects by simply 
treating the bar as a half-cycle of the appropriate width grating. The spectra are of 
the form (sin 5x/x)? where 6 is one half the bar width. This means that a single 
bar of width 1.25 min will have a fairly uniform distribution of spectral power from 
1 c/deg through 20 c/deg. Most important for the Sullivan et al experiment is the 
fact that both adapting bars have approximately the same power in the spectral 
region where the contrast sensitivity function is at its most sensitive—between 2 and 
4 c/deg. Adapting to a sinusoidal grating of 5.5 c/deg raises the threshold for all 
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test bar widths in a fairly uniform fashion without evidence of selective adaptation. 
This is presumably dependent upon the fact that all the test bars have approximately 
the same power in the region of the adapting spatial frequency, 5.5 c/deg. Corre- 
spondingly, exposure to a grating of 16 c/deg has virtually no effect on the thresh- 
olds for bars of various widths, since all of the test bar spectra have considerable 
power at other, lower frequency regions (e.g. 5 c/deg) upon which we would expect 
the 16 c/deg sinusoidal grating to be ineffective. In the same way exposure to either 
bar width is sufficient to elevate the threshold in a uniform fashion for any test bar 
width. This too is predictable from (a) the spectrum of adapting stimulus (b) the 
spectrum of the test stimulus (c) the contrast sensitivity function and (d) the 
assumption that threshold, for any target whatever, is reached as soon as any one 
of its frequency components exceeds its own threshold power. Note that this final 
assumption is central to the multiple-channel model (Graham & Nachmias 91, 
Sachs, Nachmias & Robson, 159) and is the standard assumption for interpreting 
data from any threshold type experiment (e.g. Stiles’ two-color increment threshold 
experiments, Enoch 70). The nonselective character of bar—bar interactions raises 
questions about the assumption (Pantle & Sekuler 144) that as far as the visual 
system is concerned spatial frequency and width are simply and reciprocally related. 
Certainly, the findings of Sullivan et al are inconsistent with that assumption. 

Carter & Henning (51) gave an elegant translation of some auditory experiments 
into the domain of spatial vision. This exercise was designed to show that “in 
representing visual spatial frequencies the behaviour of the eye can be considered 
as similar to the behaviour of the Helmholtz’s model of the ear” (p. 356). Visibility 
was measured using a forced-choice procedure for two kinds of test targets as a 
function of the contrast of the test targets. One target was 160 cycles of a grating 
of 5.9 c/deg while the other was a single cycle of the same frequency grating. The 
power spectra of these two targets are quite different, with nearly all of the power 
in the multi-cycle grating lying right around 5.9 c/deg while the power of the single 
cycle grating has a more nearly uniform distribution. In the main part of their 
experiment Carter & Henning used veiling visual noise which was either narrow- 
band and limited to the region around 5.9 c/deg or was broad-band noise. The 
narrow-band noise, as expected from the relation of its spectrum to the spectra of 
the two test targets, elevated the threshold only for the full grating (here both veiling 
noise and test target had power restricted to the same portion of the spectrum). 
Again, the Fourier spectra of the targets, along with the assumptions of the multiple- 
channel model permit the prediction of the data. A related experiment was reported 
for suprathreshold visibility (apparent contrast) measurements by Weisstein & 
Bisaha (206). 

Another study, by Stromeyer & Julesz (189), tested the usefulness of the Fourier 
multiple-channel model while reminding us of the growing debt researchers in 
spatial vision owe their brethren in auditory research. In a design much like classic 
masking designs in audition (160), Stromeyer & Julesz measured the masking of 
sinusoidal gratings veiled by filtered, dynamic noise of various spatial frequency 
content. Virtually all of their data converge on one simple proposition: only masking 
noise in the immediate vicinity of the test spatial frequency is effective in elevating 
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the threshold. In one experiment, for example, masking was measured as a function 
of the width of the noise band. This band was kept centered on the test frequency. 
Masking increased with the width of the masking band up to about +1 octave (an 
octave change in spatial frequency is a twofold change). Beyond this critical band, 
further noise had no effect. Stromeyer & Julesz devoted considerable time to com- 
paring the critical bands in audition (frequency) and in vision (spatial frequency). 
Among the striking parallels is the observation that in both modalities there are 
approximately 25-35 difference thresholds within a critical band. 

Before turning to some of the flies in the otherwise soothing ointment, the study 
by Maffei & Fiorentini (124) on Fourier synthesis in vision should be considered. 
Recall that just as any arbitrary luminance distribution can be represented (ana- 
lyzed) in terms of appropriate frequency sine and cosine components, the compo- 
nents can be chosen to construct (synthesize) any arbitrary luminance distribution. 
This reversibility of the Fourier transform was tested by Maffet & Fiorentini in a 
most unusual experiment. Assume that there is a mechanism in the brain, at or 
beyond the point of binocular combination, capable of Fourier synthesis. If we 
present a sinusoidal grating of frequency F to one eye and a sinusoidal! grating of 
frequency 3F to the other and if the phases of the two gratings are correct and their 
relative contrasts appropriate, the Fourier synthesizer will be receiving inputs identi- 
cal to what it would get if we presented a square wave grating just to a single eye. 
Subjects confirm the perceptual similarity of the dichoptically presented Fourier 
components to the monoptically presented square wave grating. Quantitative mea- 
surements were made by asking subjects to adjust the contrast of the dichoptic 
components until a square wave grating was seen. For five different fundamental 
frequencies, a grating appeared to be square wave when the third harmonic ampli- 
tude was 1/3 that of the fundamental. This, of course, is what we’d expect from 
Fourier synthesis. Similarly good agreement was found for the synthesis of a dichop- 
tic rectangular wave grating from sinusoidal components to each eye. It should be 
noted that presenting a high contrast (ten times threshold) fundamental produced 
an unsatisfactory fit to the theory. Low contrast fundamental frequency gratings 
seem to be critical for these experiments. 


Anomalous Observations 


Despite the growing acceptance of the idea that the human visual system may indeed 
be performing a hardware Fourier-like analysis and, at threshold, be detecting the 
power in various spatial frequency bands with independent mechanisms, there are 
several observations that will clearly require changes in the basic model. Many of 
the difficulties fall into two logical classes: one set of observations demonstrates 
interactions between channels postulated to be “independent”; the second set of 
observations is inconsistent with the strong proposition that human spatial vision 
is exclusively the product of Fourier analysis performed by the visual system. Both 
sets of observations suggest the excessive simplicity of the Fourier version of the 
multiple-channel model. The observations which will be described in this section 
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identify how the theories need to be modified; they do not show the theories to be 
without considerable value.‘ 

One difficulty seems to be with the variability in the estimate of channel width 
in the spatial frequency domain. Generally, adaptation experiments suggest broader 
spatial frequency channels than are suggested by threshold or additivity experiments 
(Graham 90). Ultimately these estimates can only be reconciled by a better under- 
standing of what we are doing to the visual system in the various “adaptation” 
paradigms. 

Tolhurst (192) has reported some observations that can be explained most easily 
if we postulate that, above threshold, spatial frequency channels inhibit one another. 
Tolhurst has found, for example, that both the fundamental and third harmonic in 
a compound adapting stimulus are less effective in their threshold raising effects then 
they would be if presented singly. Although this effect is small (about .10 log units) 
it does seem to be genuine, growing systematically with the contrast of the adapting 
stimuli. Tolhurst points out that this inhibition between spatial frequency channels 
would be analogous to the inhibition between orientation channels postulated by 
Blakemore, Carpenter & Georgeson (21) to explain some psychophysical observa- 
tions. While this suggestion of inhibition between orientation analyzers was origi- 
nally disputed (Coltheart 55, 56, Blakemore, Carpenter & Georgeson 22) it has 
received support in the studies by Sekuler & Littlejohn (169), Benevento, Creutzfeldt 
& Kuhnt (14), and Blakemore & Tobin (26). The physiological observations of 
Benevento et al present a model for inhibitory cross-talk between cortical columns 
of cells tuned to differing orientations and direction of motion. This means that the 
concept of a cell’s “trigger feature” (Barlow, Narasimhan & Rosenfeld 10) will have 
to be extended to take into account those features which inhibit a cell as well as those 
which are excitatory. Stecher, Sigel & Lange (180) have described other psycho- 
physical observations which seem to require the concept of inhibition between 
spatial frequency channels. Unlike Tolhurst, Stecher et al found inhibition between 
spatial frequencies usually considered to be within a single spatial frequency band 
(e.g. 8c/deg and 10.7 c/deg). 


"Again it may be useful to consider the parallel situation in audition. Ohm’s Acoustic Law, 
which is essentially the auditory version of what we have been calling the multiple-channel 
Fourier model, is only partially correct. Various phenomena of periodicity, pitch, and masking 
are at variance with Ohm’s Acoustic Law. W. D. Ward considers the law in his chapter on 
music perception in Tobias’ Foundations of Modern Auditory Theory: “For years musicians 
have been told that the ear is able to separate any complex signals into a series of sinusoidal 
signals—that it acts as a Fourier analyzer. This quarter-truth, known as Ohm’s Other Law, 
has served to increase the distrust with which perceptive musicians regard scientists, since it 
is readily apparent to them that the ear acts in this way only under very restricted conditions. 
...” (p. 438). My suspicion is that the multiple-channel Fourier model in vision will rapidly 
approach the status of Ohm’s Other Law—a quarter truth. But, lest this be taken as a cause 
for despair, it should be remembered that a quarter is far greater than no truth at all. We should 
not lose sight of the fact that at least under certain ideal conditions the eye, like the ear, may 
be describable as a multiple-channel Fourier analyzer. 
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Campbell & Howell (46) described an intriguing demonstration of a kind of 
monocular rivalry suitable for classroom or laboratory, which testifies strongly to 
the existence of inhibitory interactions of some kind between channels sensitive to 
(a) different orientations or (b) different spatial frequencies or both. The basic 
phenomenon is a kind of rivalry or perceptual alternation between overlapping 
sinusoidal gratings at right angles to one another. Variation in several of the parame- 
ters led Campbell & Howell to conclude the phenomenon is consistent with the idea 
“that orientations and spatial frequencies are processed in independent channels 
which are highly selective (+ 15° and + 1 octave). At the same time, these channels 
are interdependent, for the alternation must mean they interact” (46, p. 21). A 
closely related effect, alternating monocular rivalry between orthogonal contours in 
an afterimage, was reported by Sindermann & Lüddeke (174). It is certain that both 
these phenomena will receive a good deal of attention in coming years. Finally, 
Pantle (143) demonstrated that translation of a complex grating causes its compo- 
nents to interact at threshold while for a nonmoving grating the same components 
are quite independent. 

One thing seems certain, regardless of the impact these anomalous findings may 
have, any adequate model will have to be able to account not only for independence 
of channels at threshold, but also for the possibility of inhibitory interactions when 
they are suprathreshold. Similar problems are being faced in color vision and it may 
be instructive to researchers in spatial vision to examine the conceptual models 
proposed for such phenomena (Boynton, Das & Gardiner 30). There are precedents, 
in the Limulus retina and in the case of brightness contrast in humans (96), for 
supposing that the spatial interaction between adjacent elements grows as their 
respective response levels grow. This means that elements that show no evidence of 
inhibitory interactions at or near their own thresholds may show strong inhibition 
when stimulation is suprathreshold. 

Greenwood (93) compared the contrast sensitivity functions obtained with two 
kinds of targets—structured, sinusoidal gratings and unstructured, grain images. 
The large difference between sensitivity to the two kinds of stimuli (a factor of 25) 
led Greenwood to suggest that the structure of the target—quite apart from its 
spatial frequency characteristics—controls its detectability. This experiment is 
strong proof that the visual system cannot be totally described as a Fourier analyz- 
ing system. If such a description were appropriate, there would have been no 
difference between the sensitivity to Greenwood’s two kinds of targets, taking their 
spectral characteristics into account. 

Armstrong & Sekuler (4) found Fourier theory inadequate for predicting the 
extent to which a small, 1° square target containing a grating of 3 cycles/deg would 
be masked by two larger gratings of the same frequency. Their enabling assumption 
was that masking would be a function of the similarity of the spectra for test and 
masking gratings. The spectrum of the masking field was manipulated by variation 
in the vertical separation between its upper and lower halves. As the gap between 
the two halves went from zero to nonzero values, the spectrum of the masking target 
lost power at the fundamental and gained it at lower frequencies, corresponding 
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roughly to the width of the gap. The test target remained centered in the field, with 
masking gratings positioned symmetrically about the center. A two-position forced- 
choice procedure was used to measure luminance thresholds. The spectra led to the 
prediction that considerable masking would occur even when the masking and test 
targets were fairly widely separated in space (when the gap between the two halves 
of the masking grating was large). Contrary to the predictions from Fourier theory, 
masking fell virtually to zero as soon as the spatial overlap between masking and 
test targets went to zero. 

One final set of findings raises some difficulties for the Fourier multiple-channel 
model and, indeed represents a great challenge to any explanatory scheme. There 
is a variety of contours which for want of better name we may term “anomalous 
contours.” From the Fourier point of view such contours are anomalous because 
they have no representation in the stimulus’ power spectrum. They can be observed 
under several different kinds of conditions and may in fact not all be ultimately 
amenable to a uniform explanation. When a section of a random dot stereogram is 
actually empty of dots, it may appear to contain contours (57, 106). Similar anoma- 
lous contours are the sharp edges that surround depth planes in various of the 
random dot stereograms—there are regions which seem, for example, to float as 
sharply defined and outlined squares above the background. The anomaly is that the 
edges of the planes are so strikingly contoured, when in actuality such contours and 
sharp edges have no physical basis. Gregory (94) has described related anomalous 
contours in patterns with none of the complexities of random dot stereograms. 
Moreover such anomalous contours seem, according to Gregory, to be able to 
produce the standard geometric illusions, much as real contours can. While I doubt 
the usefulness of the kind of theory that Gregory advances to “explain” his observa- 
tions (he says that the visual system “postulates” the existence of the contours and, 
voila they are there), the phenomena are a challenge. Similar anomalous edges and 
contours can be seen in a computer driven display of spatial random dots. If the dots 
in a central horizontal strip are held stationary while upper and lower sections are 
caused to drift across the screen, the center section will appear to have a sharply 
defined edge which has no physical basis (P. Tynan, personal communication). 

A final kind of anomalous contour is related to the phenomenon in audition of 
periodicity pitch (108, 176). E. Levinson (personal communication) created a com- 
plex grating by amplitude modulating a 5 c/deg sinusoidal carrier with a 1 c/deg 
sinusoid. The resultant complex grating has a spectrum with power at 4, 5 and 6 
c/deg (half the power at 4 and 6 as at 5 c/deg). There was a distinct 1 c/deg 
periodicity in the complex grating. This perceptually obvious component can be 
shown to have no representation in the energy spectrum of the amplitude modulated 
grating. Moreover, the complex grating, used as an adapting stimulus, produced no 
threshold elevation for simple test gratings of 1 c/deg. 

With few exceptions, recent work with various forms of linear systems approach 
has largely ignored the question of the appearance of suprathreshold targets. One 
obvious difficulty is that as we move away from threshold signals, the obvious 
nonlinearity of the visual system intrudes (Westheimer 213). Thus as a sinusoidal 
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grating is raised in contrast it takes on an increasingly nonsinusoidal appearance, 
with the “brighter” bars seeming wider than the “darker” regions. One hypothesis 
to deai with this observation is that some nonlinear transform occurs quite early in 
the visual system (59). 

While much research on the detection of simple and compound gratings is consis- 
tent with a multiple-channel model of the kind outlined by Sachs, Nachmias & 
Robson (159), such a model requires added psychophysical linking hypotheses to 
predict how things will look. Westheimer (213) reminds us that these threshold 
experiments only tell us when a grating will be seen as different from a uniform field 
but not how it wiil appear. If one postulates that the appearance of the world is even 
in part produced by a synthesis of information in various spatial frequency channels 
(Maffei & Fiorentini 124), there ought to be a whole set of unexplored perceptual 
phenomena which are produced by shifting balances among the responses of various 
spatial frequency channels. Among the few explorations of this possibility is the 
study by Blakemore, Nachmias & Sutton (25) of the shift in apparent spatial 
frequency of a suprathreshold grating as a function of adaptation to various frequen- 
cies. But how, for example, does the apparent frequency of a grating vary with 
exposure duration? Several studies have shown that the sensitivity of various spatial 
frequency channels is differentially affected by short exposures (137, 138). If the 
balance of activity in various spatial frequency channels shifts with exposure dura- 
tion, we might expect an illusion of altered apparent spatial frequency with short 
exposures of suprathreshold gratings (cf 194a). 


Feature Detectors 


In parallel with the advances in the development of multiple-channel models for the 
analysis of spatial frequency, there has been an accelerated growth in studies which 
purport to demonstrate the existence of feature detecting mechanisms by which “the 
brain decomposes the physical image of the external world into a great multitude 
of anatomic channels each of which is processed independently before being ulti- 
mately synthesized into a subjective global impression” (88, p. 2). If this approach 
is ultimately to fulfill its promise, several problems will have to be resolved. First, 
we will have to consider what the relationship of these mechanisms is to other 
dimensions of spatial visual sensitivity, including spatial frequency. Does, for exam- 
ple, the analysis of spatial frequency information proceed in parallel with that of 
orientation? If not, can we make supported statements about the sequence in which 
features are analyzed? Two studies related to this question will suffice here. Camp- 
bell & Maffei (48) claimed to show that the tilt after-effect was not related to the 
spatial frequency of the adaptation and test targets. A similar finding has been 
reported by Parker (146). If these claims are taken at face value, the tilt induced 
on a test grating is independent of the spatial frequency relationship between it and 
the spatial frequency of the adapting grating. Campbell & Maffei used this datum 
to support the idea that “the mechanism responsible for orientational selectivity 
occurs before the processes leading to spatial selectivity” (48, p. 838). These theoreti- 
cally significant findings require a more secure experimental basis since (2) they are 
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inconsistent with other work in the literature, notably on the McCollough effect, 
(b) the data themselves are not very convincing, and (c) at least one investigator 
(M. Sullivan personal communication) did find evidence of spatial selectivity in the 
tilt after-effect using forced-choice procedures. 

Tolhurst & Hart (194) found that thresholds for moving contours were elevated 
equally by prior retinal image movement whether it was produced by (2) eye 
movement with a stationary field, or by (6) a moving field with no tracking eye 
movements. This suggests that the site of direction specific adaptation in the human 
visual system is prior to point of operation of the corollary discharge which dis- 
counts eye movements. Other studies of direction specific adaptation have been 
reported by May (131), Erke & Gräser (73), Breitmeyer (33), Luria & Ryan (120), 
and Beck & Stevens (13). Related to the issue of the corollary dischage (Skavenski 
175) are reports of cells in cat visual cortex which respond differentially to uniform 
and relative motion in the field (Burns, Gassanov & Webb 42, Bridgeman 35). Such 
cells would facilitate the discrimination of retinal image motion which was due to 
motion of objects from retinal image motion components produced by eye-move- 
ments. The implications of the analysis of relative motion for the perception of 
movement are discussed elsewhere (167). 

There is an often ignored danger inherent in the assumptions behind experiments 
which try to identify the relationship between the neural sites at which various 
information is extracted and used (Tolhurst & Hart, 194), i.e., it is possible that any 
phenomenon can be generated in more than one site in the visual system. Julesz 
(106) has argued, for example, that there are several sites at which information about 
visual motion is extracted. This possibility needs to be taken into account in all 
studies where an attempt is made to identify the locus of some phenomenon of 
spatial vision or to identify the relationship between the loci for two different 
phenomena. 

The family of effects related to the McCollough effect (121) continued to grow 
at an alarming rate during this review period, with a number of new observations 
(34, 76, 117, 131, 132, 135, 136, 188). While work continues on these challenging 
long-lived phenomena, there seems to be a growing sophistication in the uses to 
which after-images and stabilized retinal images are put in the service of the exami- 
nation of feature analyzers. The papers by D. Brown and his associates (39, 40, 60, 
164) and those of Atkinson (5, 6) testify to this point. The experiments by Atkinson 
suggest that after-images may bear stronger functional resemblance to the character- 
istics of their “real” predecessors than had hitherto been suspected. Not only does 
Atkinson show that after-images of line exhibit the same meridional anistropies that 
real lines do (see also 201), but she describes rather complex inhibitory interactions 
between pairs of lines in an afterimage. Such interactions are reminiscent of Parlee’s 
(147) measurements of masking of one line by another. Because of Atkinson’s 
observations and those on alternating monocular rivalry in after-images (174) one 
wonders about the extent to which after-image of contours, for example, behaves 
like “real” contours in a variety of paradigms. Perhaps good use can be made of the 
concept “equivalent contour,” in analogy with “equivalent background” for spa- 
tially uniform fields (12). 
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Finally, Bagrash (8) and Kerr & Thomas (110) extended a non-Fourier multiple- 
channel model measuring selective adaptation with radially symmetric stimuli 
which cannot be dismissed as candidates for visual system eigenfunctions. There 
were two new studies of feature detectors from the Cambridge Laboratory. Tolhurst 
(193) gave us the first good evidence of edge detectors (as opposed to bar or grating 
detectors or what have you) and proposed a new, tenative explanation of both Mach 
bands and the Craik-Cornsweet-O’Brien illusion. See also references 153, 161, 171, 
and 198 for additional discussion of this illusion. Sharpe (171) used temporal modu- 
lation to make visible the shadows of the retinal blood vessels. He reports that the 
shadows, being essentially stabilized retinal images, fade and require increasingly 
higher contrast over time to remain visible. Most relevant to the issue of feature 
detectors is the finding that the fading which transfers binocularly and may have 
a central component, exhibits remarkable specificity for orientation and direction 
of movement. Having mentioned interocular transfer, I would be remiss if I did not 
cite Andrews’ (2) brief and provocative observation about the possible information 
to be gained from such measurements. These observations are well complemented 
by those of Julesz (106) on the same subject. 


OBSERVATIONS WITH CLINICAL IMPACT 


As indicated in the preamble to this review, one of the trends which has grown 
during the review period is the cooperation between clinicians and nonclinicians 
with a mutual interest in spatial vision. As evidence for this claim, consider the 
following sample of observations. 

The spatial contrast sensitivity function, discussed earlier, is both a descriptive 
and normative tool and has proven important in several clinical contexts recently. 
Bodis-Wollner (28) found changes in the contrast sensitivity functions of patients 
with cerebral lesions. While most patients showed a relatively greater loss at the high 
frequency end of the function, at least one showed a deficit in the midfrequency 
portion of the range (2-10 c/deg). While one must exercise great care in interpreting 
these results, several things are clear. First, it is important, as Bodis-Wollner re- 
minds us forcefully, to distinguish between acuity, as normally defined, and the 
information represented in the contrast sensitivity function, since acuity measures 
the resolvability of small details presented at very high (1.0) contrast. One could 
argue that the contrast sensitivity function is a more ecologically valid measure of 
ability to resolve details, since only rarely are we presented with 1.0 contrast objects 
in the real world. Second, Bodis-Wollner makes the following fascinating sugges- 
tion: “Assuming that the brain does use a harmonic analysis for pattern recognition, 
then in suprathreshold stimulus conditions a proper balance of independent signals 
is required to account for an invariant neural representation of the retinal image. 
In this case a nonuniformly altered contrast sensitivity would pose a greater diff- 
culty in pattern recognition than a simple drop in visual acuity” (28, p. 770). One 
can see in Bodis-Wollner’s data the recovery of normal contrast sensitivity over the 
period of six weeks testing, paralleling the patient’s own recovery. Data on one of 
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his patients, showing a depression mid-range in the contrast sensitivity function, 
suggests a loss in one channel of a multiple-channel system. If we did, in fact, have 
good solid normative data against which to judge a given subject’s function, Bodis- 
Wollner’s idea of plotting a “visuogram”—the ratio, at each spatial frequency, of 
normal contrast sensitivity to that of a given patient—would be a good one. But 
clearly the normative data are needed first. 

Another clinical use of the contrast sensitivity function was reported by Green 
(92). Taking advantage of laser produced interference fringes to bypass the optics 
of the eye, Green was able to estimate the likely postoperative acuity of cataract 
patients. Such information, of course, provides an important guide to the likely value 
of the cataract removal. 

Mitchell, Freeman, Millodot & Haegerstrom (81, 133) used various sorts of 
grating targets to measure the neural consequences of early selective contour depri- 
vation in human vision. There is a growing literature on the visual effects of rearing 
neonatal animals under various conditions of stimulus deprivation (Blakemore & 
Cooper 23, Hirsch 98, Spinelli, Hirsch, Phelps & Metzler 178, Ganz, Hirsch, & 
Tieman 83) or rearrangement (Shaler 172). In a brief summary statement on this 
literature, Barlow & Pettigrew (11) offered the provocative comment that in kittens 
only two stimulus analyzing functions emerge without requiring appropriate envi- 
ronmental inputs: functions sensitive to direction of motion through the field and 
a kind of primitive sensitivity to binocular disparity. 

Kittens reared under normal, nondeprived conditions have simple cortical cells 
“tuned” to contours of various orientations, with orientation preferences distributed 
uniformly about polar coordinates. Hirsch & Spinelli (99) and Blakemore & Cooper 
(23) showed that if kittens were reared for several months, from birth onward, in 
a manner that kept them from seeing contours of a particular orientation (e.g. 
horizontal) there would be a paucity of simple cells tuned at or near that deprived 
orientation. Moreover, the kittens would appear to be behaviorally blind to contours 
oriented at or near the deprived orientation. Mitchel] et al reasoned that similar 
neural alterations might occur in humans whose visual systems were selectively 
deprived of exposure to certain stimulus orientations. In astigmatism there is an 
inequality in sharpness with which contours of different orientations are imaged. 
Astigmatic subjects might then show the neural consequences of this partial and 
selective contour deprivation. Using laser produced interference fringes on the retina 
as well as conventional gratings, Mitchell et al found clear evidence for a neural (as 
opposed to optical) meridional ambiyopia—loss of acuity for certain stimulus orien- 
tations. Moreover, the degree of meridional amblyopia was strongly related to the 
degree of astigmatism. This work implies that unless astigmatism is detected and 
corrected very early in life it may very well produce lasting changes in the develop- 
ment and organization of the visual nervous system. Mitchell et al offered some 
speculation on the relevance of their results to possible cross-cultural differences in 
perception. It is well-known that a variety of visual functions exhibit what might 
be termed the Meridional Effect (Appelle 3). Acuity, detection and other visual 
functions seem to be better when the targets are oriented at vertical and horizontal, 
and poorest at orientations between (obliques). Mitchell et al proposed that such 
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adult deficits at oblique orientations might arise from the relative paucity of oblique 
contours in the environments of children raised in the highly technological societies 
filled with man-made horizontal and vertical contours. 

The speculations of Mitchell and his associates are made even more intriguing by 
the recent demonstration by Fiorentini, Ghez & Maffei (78, 79) that the Meridional 
Effect depend in adults upon experience. Fiorentini et al had human subjects wear 
Dove prisms, which rotated the visual field by 30° or 45°, The prisms were worn 
for periods of 4 to 7 days, during which time the subjects experienced a decrease 
in apparent tilt (i.e. adaptation to the sensory rearrangement). In rough parallel with 
this phenomenal adaptation there were changes in two measures of the Meridional 
Effect. First, there was a decrease in the usual difference between contrast threshold 
for a vertical grating and the threshold for an oblique grating. Second, there was 
a decrease in the difference in the averaged evoked potential produced by an oscillat- 
ing vertical grating and an oscillating oblique grating. Fiorentini et al observed “The 
findings suggest that during prisms adaptation plastic changes occur in the postreti- 
nal mechanisms responsive to target orientation.” (78, p. 545). 

The relationship of Mitchell et al’s findings to certain common clinical problems 
is worth noting. In particular, amblyopia ex anopsia (amblyopia of disuse) may 
occur when, in early life, one or both eyes is deprived of normal visual stimulation 
by corneal opacities, congenital cataracts, or other obstacles (von Noorden 200). 
Clearly, amblyopia of all forms, including strabismic amblyopia, is a disorder that 
researchers in spatial vision are going to have important input to. Since amblyopia 
is estimated to effect some seven million Americans (32) such research efforts ought 
to be strongly encouraged. 

In order to help identify the neural locus of functional amblyopia in one of their 
patients, Lawwill and his colleagues (112) used the Westheimer effect (211) to 
compare receptive field sizes in the amblyopic and normal eyes. With the visual 
evoked potential as a dependent measure, Lawwill et al found the critical sizes of 
the surround, in the Westheimer paradigm, to be substantially enlarged in the 
amblyopic eye. This led them to conclude that the locus of the functional amblyopia 
is peripheral to the cortex, the site of the evoked response. 

Work is continuing on various attempts to use the visual evoked potential as the 
basis for an objective, automatic, and sensitive refraction (Ludlam & Meyers 118, 
Dawson et al 63). Finally, Poppel, Held & Frost (149) reported some rather remark- 
able observations on residual visual function in areas of the field which are quite 
blind by standard definitions. Although the residual function is exceedingly slight, 
its existence is of considerable importance for theories which implicate midbrain 
structures in vision. Moreover, and most relevant here, the unusual stimuli required 
to demonstrate the effect remind us of the importance of perimetry with moving 
targets (7, 74). 


METHODOLOGICAL ISSUES 


Regardless of the brilliance of its insights, a research area can only be as good as 
the tools it uses. In spatial vision, most experiments depend strongly on the integrity 
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and replicability of earlier findings. Several of the experiments discussed here require 
the observer to make judgments that can only be described as “difficult.” Especially 
when dealing with small effects (e.g. 0.3 log units or less), it is important to establish 
that criterion shifts of one kind or another are not distorting the results. It is ironic 
that while the field of spatial vision has grown more sophisticated in the techniques 
of stimulus generation and in theory-building, there has not been a corresponding 
increase in the sophistication of psychophysical procedures. While all data do not 
have to be collected using forced-choice procedures, in experiments requiring diffi- 
cult judgments it would be useful to check some of them using more objective 
procedures. Consider but two examples of this point. Weitzman, Smith & Karasik 
(207) using a signal detection analysis, found that at least a part of the Meridional 
Effect may be due to criterion rather than pure sensitivity differences. Pollack (150) 
found that “subjectively” subthreshold stimuli were quite detectable if forced-choice 
testing was used. 

One advantage of a forced-choice procedure is that the observer is presented with 
a null stimulus on each trial, aiding him to discriminate the presence from the 
absence of the target whose threshold value is being sought. Particularly in experi- 
ments where it may be difficult for the observer to discriminate the test target from 
some after-image of the adaptation stimulus, a clearly defined null stimulus would 
be invaluable. As a kind of compromise between the time consuming psychophysics 
of signal detection theory (Nachmias 139) and the very quick, but very dirty, 
psychophysical method of adjustment, experimenters might consider the use of an 
“implicit null stimulus.” This would entail restricting the test target to a previously 
specified portion of the field, leaving the rest of the field uniform at the same space 
average luminance. The ability to compare the top and bottom of the field with the 
knowledge that the test target will only appear in the top half, would make even 
difficult judgments easier, Le, less absolute. The cost of this procedure is the possibil- 
ity that edges will complicate the spectrum and affect the visibility of the test target 
(43). 

The evidence of interaction between spatial and temporal aspects of visual stimu- 
lation suggests that theories of spatial vision cannot completely ignore the temporal 
characteristics of that stimulation. Some of the interactions with temporal variables 
may, in fact, provide good insight into the nature of the spatial response itself (e.g. 
137, 138, Watanabe et al 203). If, to take a conservative position, there is any 
interaction whatever between the spatial and temporal selectivities of various mech- 
anisms in the human visual system, care must be taken in generalizing from experi- 
ments on spatial vision in which only a single value of temporal stimulation has been 
used. The pattern of spatial responses to stimuli that are temporal impulses may be 
quite different from those to stimuli with extended step functions. These possibilities 
need to be investigated. A paradigm that is of potential use in such investigations 
has been introduced by Matin, Matin, Hirsch & Kornheiser (129). Their adaptation 
of Crawford’s (61) rapid dark and light adaptation procedure measured the apparent 
tilt of a line presented for 10 msec at various temporal positions before, during and 
after an inducing line of 3 sec duration. Large transient effects at onset and offset 
of the inducing stimulus were strikingly similar to those which Crawford reported 
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for early light and dark adaptation. Related to this paradigm is the fact that we must 
always be alert to the possible effects of the temporal rise and fall rates of spatially 
defined stimuli, particularly when their total duration is short. At the very least, 
information about this variable ought to be reported if not properly controlled (e.g. 
Carter & Henning 51). 

Selective adaptation has often been used to “isolate” mechanisms sensitive to one 
or another aspect of stimulation. In this work, temporal variables have largely been 
ignored. As we shall see, this fact seriously jeopardizes the value of the data so far 
accumulated. Orientational selectivity, attributed by most workers to the charac- 
terisitcs of orientation analyzers like those described by Hubel & Wiesel, has been 
studied with three different temporal conditions: test target preceding the adaptation 
field, test target following the adaptation field, and the two targets overlapping 
temporally. Kulikowski, Abadi & King-Smith (111) showed convincingly that these 
temporal conditions do not yield equivalent estimates of orientational selectivity, 
though the first procedure just described, in which test precedes adaptation, was not 
examined. Additional complications are introduced into this messy picture when we 
consider the implications of the observations of Matin et al (discussed above). Those 
observations suggest that not only might the three different temporal arrangements 
give different data on orientational selectivity, but, moreover, temporal variation 
within any of the three procedures might give still other answers. For example, 
would selectivity vary, using temporally overlapping test and adaptation fields, with 
their onset asynchrony? The observations by Matin et al and related data on orienta- 
tion selectivity in backward masking (166) make one suspect that it would. All of 
this has rather disquieting implications not only for the invariance of psychophysical 
estimates of orientation selectivity, but also for the invariance of other measures as 
well. For example, temporal variables have largely been ignored in experiments 
which measure the width of spatial frequency channels. Part of the discrepancy 
among estimates of channel width may in fact be related to just such differences 
between the temporal aspects of the relevant experiments. 

So far the methodological suggestions have been largely directed toward thresh- 
old measurements. What about measurements of suprathreshold phenomena? There 
seems to be an increasing willingness to use the kind of dependent measure ad- 
vocated so forcefully by S. S. Stevens. Stevens, who passed away during this review 
period, argued for the use of direct scaling procedures, magnitude estimation among 
others. While there is nothing logically “wrong” with these procedures, one would 
hope that in the future such dependent measures will be calibrated at the very least. 
For example, numerical estimates of the apparent contrast of constant contrast 
targets (206) should be calibrated with similar estimates made to targets of various 
contrast. Such calibration data would very much enhance the interpretability of the 
subject’s verbal responses. Working on a somewhat different problem, Sindermann 
& Liiddeke (174) did provide the kind of calibration advocated here. 

In two papers published posthumously (198, 199), von Békésy described the use 
of a novel procedure by which suprathreshold spatial effects such as Mach bands 
could be measured objectively. In the von Békésy compensation procedure a subject 
is given control over the spatial distribution of luminance and adjusts it until the 
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Mach band is just cancelled. The luminance required to compensate for the Mach 
band measures the band’s strength. This procedure is one that lends itself to creative 
implementation with online computer driven cathode ray displays. 

The brief, but important, note of Williams & Williams (217) should remind us 
to make sure subjects do not hum while observing. Humming while watching 
television produces horizontal lines on the screen whose frequency and other charac- 
teristics are a function of the hum’s qualities. Rushton had earlier given a theoretical 
basis for these observations. While humming might improve the interest of many 
television programs, it might also wreak havoc with data from experiments using 
cathode ray displays of various kinds. 

Finally, useful developments in areas just peripheral to spatial vision ought to be 
briefly mentioned. Barlow, Narasimhan & Rosenfeld (10) have argued convincingly 
for strong cooperation among psychologists, physiologists, and computer scientists 
in a common attack on the problems of spatial vision and pattern recognition. This 
sort of interaction has already produced some computer programs constructed to 
mimic the hierarchical feature analyzing operations described by some physiologists 
(Wallingford 202, Fukushima 82). While the present state of the art in all three 
disciplines—psychology, physiology, and computer science—limits the real infor- 
mativeness of hardware models, there is hope that computer models will develop 
to the point where the idiosyncracies of their operation suggest new insight into 
spatial vision just as the idiosyncracies of psychophysical and physiological observa- 
tions now do. 
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This review focuses on single-unit recording data from mammalian first-order 
afferent fibers and from cerebral cortex and, where possible, on the relationships 
between these data and psychophysical data. It is largely confined to research 
reports appearing in archival literature from 1965 through 1972, except that only 
the latter half of the period is included for the vestibular senses, because they were 
the exclusive topic of the preceding review of somesthesis. 

Every reviewer for the Annual Reviews must become a firm believer in the 
statement that most of the scientists who ever lived are alive and publishing today. 
It is a source of real regret that it is practically possible to review only selected topics 
within a research area, and that it is not possible to discuss every paper that may 
have some bearing on a topic. 


TACTILE SENSITIVITY 


Recording single nerve fiber activity in awake human subjects while simultaneously 
making psychophysical measurements and obtaining the subjects’ verbal reports 
would be a powerful research tool for the correlation of electrophysiological and 
psychological data, provided that man had been considerate enough to develop only 
one homogeneous class of receptors and afferent nerve fibers. Then, even though the 
electrophysiological data were obtained from only one or two nerve fibers at a time, 
while the subject’s responses were based on the whole set of fibers that were activated 
by the stimulus, the investigator could proceed with his correlations knowing that 
the fibers from which he recorded reflected accurately the behavior of the whole set 
of fibers. Inasmuch as there are several classes of mechanoreceptors in human skin, 
only when stimuli are restricted in locus and limited to threshold values can one 
sometimes be reasonably sure of the correlation between the subject’s verbal report 
and the unitary nerve-fiber activity picked up by the investigator’s microelectrode. 
At suprathreshold levels, although subjective reports are likely to reflect the activity 
of more than one class of receptor, it is still possible to examine the single-unit 
responses to various kinds of stimulation and to relate these to comparable data 
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obtained from other species as a check on the validity of extrapolating the data from 
the other species to humans. 

Data of this sort are becoming available. It is possible to obtain single-unit 
recordings from the large nerve fibers in the median nerve of awake man (73), but 
it is difficult to avoid injury to the nerve fiber (72). “Spontaneous” activity in these 
large-fiber mechanoreceptors was not accompanied by sensation. “Rapidly adapt- 
ing” and “slowly adapting” receptors were found in the hand, as one would expect 
from the data on monkey (43, 44, 58, 68), with the receptive field sizes for both being 
about half the diameter of a finger tip (22). A stimulus just strong enough to elicit 
one or two action potentials in the fiber from a rapidly adapting receptor is at 
threshold for a verbal report of sensation; this report extends the finding by Hensel 
& Boman (27) some years ago that a single action potential from a human hair 
receptor sufficed to elicit sensation. Many of the slowly adapting units were less 
sensitive than the rapidly adapting ones, responding only to indentation sufficient 
to yield a steady pressure sensation. In a survey of over 60 mechanoreceptive units 
in human glabrous skin (40), slowly adapting units were three times more numerous 
than were rapidly adapting units. The authors recognized that their technique was 
biased for large fibers, but felt that humans have a relatively higher proportion of 
slowly adapting units than monkeys do. 

An alternative approach to the correlation of electrophysiological data and hu- 
man psychophysical data is to obtain the electrophysiological data from conven- 
tional preparations in another species, being careful to use the same experimental 
paradigms, apparatus, and procedures that are used in obtaining the human psycho- 
physical data. A survey of the afferent populations involved enables one to recon- 
struct what the response of the population must be to the same stimulus parameters 
that were employed in the human psychophysical study. One can then test the 
assumption that a similar population of afferents was involved in the human re- 
sponse by seeing how well the human response can be predicted from the electro- 
physiological data. 

This has been done for a precisely controlled vibrating stimulus applied to the 
hand (68). Single-unit recordings were obtained from the median nerve of macaque, 
and thresholds and magnitude estimates were obtained from human observers. 
Single-unit recordings were obtained from quickly adapting mechanoreceptors, 
slowly adapting receptors, and Pacinian receptors. Their threshold and suprathresh- 
old responses were measured in response to a number of variations in the vibratory 
stimulus parameters. The human response to the same stimuli was obtained. One 
aspect of the human response was that the sensations resulting from stimuli in the 
frequency range 5-40 Hz differed from those resulting from frequencies above 60-80 
Hz. The sensation corresponding to the lower frequency stimulus was one of flutter 
or oscillation, while the sensation corresponding to the higher frequency stimulus 
was one of buzz or hum. There were thus two categories of sensation to be accounted 
for, and three categories of mechanoreceptor that might have been involved. Be- 
cause of the similarity in the experimental paradigms and stimulus parameters that 
had been involved m both the electrophysiological and psychophysical data, it was 
possible to conclude that the slowly adapting mechanoreceptors had “no essential 
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role” in the sensations of flutter or of vibration, because the receptors disagreed in 
their behavior with the psychophysical findings; on the basis of the agreement 
between electrophysiological and psychophysical data it was reasonable to presume 
that the quickly adapting receptors were involved in the sensation of flutter, and the 
Pacinian receptors in the sensation of vibration. Several other correlations between 
the neural and psychophysical data were shown as well. In subsequent experiments 
(53), these data were confirmed and were extended by showing parallelism between 
monkey and human psychophysical data. 

An analogous set of experiments was carried out to extend the findings to recep- 
tors of hairy skin (52). Here the flutter sensation was found to depend upon activity 
in rapidly adapting alpha fibers that end in or about hair follicles, while the high 
frequency vibration sensation was again signalled by Pacinian afferents. In this case 
there were three other classes of mechanoreceptors that might have been involved 
in carrying information about stimulus periodicity, but as was the case in glabrous 
skin, it could be shown that their threshold and suprathreshold behavior in response 
to variation of stimulus parameters did not match the psychophysical data. It is 
worth noting that in all cases the different classes of mechanoreceptive afferents were 
in fact carrying information about the periodicity of the stimulus, as evidenced by 
the time locking of their discharges to the cycles of the stimulus. The question of 
what use, if any, the nervous system makes of this information is like the question 
that will come up in a later section of this review: what use, if any, does the nervous 
system make of the temperature information that is present in the output of some 
populations of temperature-responsive mechanoreceptors? It is therefore worth not- 
ing here that the question of how the stimulus-periodicity information is used by 
the several classes of receptors in the three studies just cited was answered, not by 
comparing the electrophysiological data from the various classes of mechanorecep- 
tors with respect to their dynamic range, differential sensitivity, or degree of time 
locking, in an effort to decide which were the most versatile and faithful reporters 
of vibratory stimuli, but was answered by directly comparing the human psycho- 
physical data with what one might call the electrophysiological psychophysical data. 

In the period under review there have appeared a number of experiments on 
single-unit first-order afferent fibers in which careful attention has been paid to the 
control and variation of stimulus parameters, although comparisons with human 
psychophysical data have been limited largely to published reports from other 
laboratories. Probably the most extensively studied class of receptor during the 
period was the Pincus Haarscheibe, or tactile pad, which now is commonly called 
the Type I receptor. Its characteristics have been studied in many creatures, includ- 
ing sheep, mice, rats, rabbits, guinea pigs, opossums, cats, monkeys, and humans 
(45, 46, 49, 50, 63, 64, 66, 69, 71). Together with studies cited in the survey of 
somesthesis in the 1966 Annual Review of Psychology, these studies have docu- 
mented in detail the microscopic and ultramicroscopic structure of the receptor, the 
quantitative details of its displacement sensitivity and displacement-rate sensitivity, 
as well as its response to various frequencies of sinusoidal stimulation, and the way 
in which the discharges from several such receptors are integrated as they converge 
upon the stern axon. Unfortunately, the literature is contradictory on whether these 


236 HAHN 


receptors have anything to do with sensation. On the one hand, Tapper (70) has 
reported establishing in cat a conditioned avoidance to weak tactile stimuli to Type 
I receptors. On the other hand, Harrington & Merzenich (25) did not obtain positive 
reports of sensation in response to stimulation of Type I receptors in human subjects. 
Some investigators believe that they have experienced sensations resulting from 
stimuli confined to their own Type I receptors; other investigators are convinced 
that stimulation of their Type I receptors produces no sensation. Part of the problem 
may be that Type I receptors are often not as easy to identify unambiguously in 
humans as they are in some of our favorite laboratory animals. Smith (64), in eight 
samples of human abdominal skin, found Type I receptors easy to identify in two 
samples, hard to identify in four samples, and impossible to identify in two samples; 
moreover, some presumed Type I receptors on sectioning turned out to be skin, not 
Type I receptors at all. Perhaps more convincing evidence will become available 
from experiments in which Type I receptors in humans are identified on the basis 
of electrophysiological characteristics as well as visual ones. 

Several other functional types of mechanoreceptor have been characterized dur- 
ing the period under review. Two types of phasic receptor in cat’s foot (47), and a 
slowly adapting receptor in glabrous skin of raccoon (54) have been described. The 
Type II receptor of hairy skin (11) has small punctiform receptive fields easily 
excited by stretching the skin as well as by indenting the skin, a low discharge rate 
in the absence of an experimenter-imposed stimulation, and a discharge rate with 
fairly uniform interspike intervals in response to steady stimulation. Three types of 
hair receptor have been described in cat and rabbit (6). They are all rapidly adapting 
receptors, sensitive to velocity of displacement. Tylotrich hair receptors (50) and 
down hair receptors respond with a stream of discharges during movement of the 
hair; the guard hair receptors respond with not more than three or four impulses. 
Another laboratory confirmed these classifications (9, 59), but divided the guard 
hair receptors into two types—G, with higher velocity thresholds, and G, with 
lower velocity thresholds—and in addition announced the existence of yet another 
receptor type in hairy skin, the field receptor, with rather large, stretch-sensitive 
receptive fields not associated with hairs. In fact, the investigators suggested that 
there might be two functional types of field receptor, but felt that their data did not 
support this hypothesis with sufficient strength to make that assertion. 

Response characteristics have also been examined for the receptors associated 
with the carpal hairs on the leg of cat (56, 57) and for the receptors of the mystacial 
vibrissae of rat (80) and cat (21, 23, 55). Each vibrissa is innervated by several nerve 
fibers that serve only that vibrissa, and each is sensitive only to certain directions 
of vibrissa movement. The inference is made that the vibrissae do not serve merely 
as obstacle detectors, but are well suited to serving as active “feelers” of the animal’s 
oral environment. The number of functional classifications of receptor types serving 
the vibrissae has been hypothesized to be at least two (23), four (21), or four plus 
a miscellaneous category (80). 

As the number of functional receptor classifications grows, one cannot help 
wondering to what extent history is repeating itself. A similar proliferation of 
groupings took place in the cytoarchitectonic classification of cerebral cortical areas, 
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and in the anatomical classification of sensory nerve endings in the skin, before it 
was accepted that not every observable difference among cells meant that they 
belonged to different populations. Perhaps these doubts arise only because it is hard 
to imagine why the cat needs in his hairy skin half a dozen different types of 
mechanoreceptors, including a separate type for each type of hair, in addition to the 
somewhat more deeply lying Pacinian type and to the low-threshold mechanorecep- 
tors served by C-fibers. More likely the problem is that there is still much to be 
learned before we know which of the observed response differences are important. 
For example, if it were to be found that a single anatomical type of ending had two 
modes of termination in the tissues, in one of which its thresholds were lower than 
in the other, should it be said that there are two types of receptors? There are 
problems, of course, in deciding what defines a single ending (36, 48). The variety 
of anatomical classification criteria, the variety of names that have been attached 
to what may be a single ending, and the fact that we need to know more about 
variability within and between receptors in different tissues and in different species, 
all mean that the correlation of structure with function is complicated by uncertain- 
ties on both sides. 


THERMAL SENSITIVITY 


Warm 


Temperature receptors continue to be slow to yield their secrets, but progress is 
being made. Recordings from multifiber preparations do show increasing activity 
with skin warming (51, 65), and single-unit recordings have been obtained from 
receptors that are specifically sensitive to warming and that show little or no 
responsiveness to mechanical stimuli. Data have been obtained from such units in 
monkey (26, 29) and in the nasal region of cat (28, 30). The axons appear to be in 
the C- and small A-fiber range. They have regular discharges that increase in rate 
with increases in the temperature of the stimulus above the temperature of the skin; 
to a sudden temperature rise they may show a dynamic response of as high as 200 
impulses per second, but the rates signaling steady temperatures are much lower, 
ranging about 10 to 20 impulses per second. There is some suggestion, from a sample 
of eight fibers, that two populations of response characteristics may exist (29), 
although a-much larger sample will be required to verify this. 

Relating the electrophysiological findings to the behavioral findings regarding 
warmth sensitivity continues to be difficult. Based on avoidance conditioning data 
in the cat (38), the inference has been made that cats cannot feel warm stimulation 
on the back or thigh unless the warmth reaches noxious intensities. Nevertheless, 
new findings (4, 51, 65) continue to add support to the earlier literature showing 
the existence in these areas of receptors sensitive to only moderate degrees of 
warming. On the other hand, behavioral measurements (5) of the sensitivity of the 
cat’s upper lip to warming show a sensitivity here to as little as 1°C of warming, 
although the electrophysiological data from infraorbital nerve serving the same area 
(37) show no neural acitivty resulting from this warming, but only a diminution of 
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the already ongoing activity. The investigators raised the old question of whether 
such diminution of activity could itself be the warmth signal. Our present state of 
knowledge seems to be this: In some body loci there are skin receptors that signal 
the presence of a warm stimulus, but this information appears to be without behav- 
ioral consequence to the animal; there are other body loci in which warm stimuli 
are of behavioral consequence to the animal, but in which specific warmth receptors 
have been sought and not found. 


Cool 


Cool receptors are not as hard to find as warm receptors are. As more data accumu- 
late on cool receptors in different body areas (2, 31, 32, 39) it appears that cool 
receptors have in common an inverted U-shaped function relating both their static 
and dynamic firing rates to skin temperature. The maximum firing rate is typically 
found in the range 21°C-34°C, with an average around 27° or 28°. Some units are 
still firing at 10°C or 40°C. Maximum firing rates at static temperature levels are 
low, in the vicinity of 5-15 impulses per second, although firing rates rise to over 
100 per sec for more rapid temperature changes. 

The inverted U-shaped function relating firing rate to temperature means that it 
is often ambiguous which temperature is being signaled by a given firing rate. One 
of the mechanisms that has been proposed for resolving this ambiguity will be 
described later in this review, in the discussion of receptors that respond both to 
mechanical and to thermal stimulation. Another proposed mechanism for dealing 
with the ambiguity (33, 34) makes use of the following observation: As the tempera- 
ture applied to cold receptors drops, the initially uniform firing rate gives way to 
a “bursting” pattern of activity in the region of 33°C; as the temperature further 
decreases to the general vicinity of 24°C, the impulses tend to cluster into groups 
of two up to six action potentials. Receptor temperature thus could be encoded 
either in terms of the number of impulses per burst, or in terms of the mean 
inter-spike interval within each burst, because both of these parameters are 
monotonically related to temperature within this roughly 10° range. (Below the 
lower temperature the bursting pattern turns into an irregular train of impulses.) 
Clearly the temperature information is there in either one of these parameters of the 
spike train; it remains to be ascertained whether it is used in this form. 


Mechanical Plus Thermal Sensitivity 


Several types of mechanoreceptors are also responsive to cooling. Action potentials 
are initiated at the onset of cooling, and if the unit is already discharging to a 
mechanical stimulus, cooling will increase the steady discharge rate. During the 
period under review, two interpretations of these well-established findings have been 
discussed. One view (35), and probably the prevailing one, is that the temperature 
effect is merely “noise” in the system, and has nothing to do with temperature 
sensation. The fact that these mechanoreceptors are notably less sensitive to temper- 
ature change than are true temperature receptors, and the fact that these me- 
chanoreceptors can be made to produce much higher discharge rates to mechanical 
than to thermal stimuli, are offered as evidence to support the inference that these 
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receptors play no part in temperature sensation. Since the firing rate to a given 
mechanical stimulus is increased when that stimulus is cooled, one would predict 
that a coin resting on one’s forehead would feel heavier if it were cool than it would 
feel at skin temperature; almost a century ago Weber shows this to be true (75). It 
was shown (67) however that a coin also feels heavier when its temperature is raised 
above skin temperature, which is hard to reconcile with the fact that the tempera- 
ture-responsive mechanoreceptors fire at lower rates when the temperature of a 
mechanical stimulus is raised. 

An alternative interpretation has been suggested: the temperature responsiveness 
of these mechanoreceptors does play a part in temperature discrimination, but it 
does so only in conjunction with a quantitatively different temperature characteristic 
of another class of receptor. If there were two classes of receptors that differed 
systematically from each other in their thermal and mechanical responsiveness, their 
outputs would then in effect present the central nervous system with a pair of 
simultaneous equations that it could solve to recover the information about either 
the thermal or the mechanical stimuli, or both, that were producing the discharges 
in two sets of afferent fibers. In the oral-lingual regions of monkeys, for example, 
two classes of temperature-responsive neurons have been described by Poulos & 
Lende (60, 61). The class of unit that responded to thermal but not to mechanical 
stimulation showed the typical inverted U function relating firing rate to tempera- 
ture that has been found in other units responsive to cooling. A second class of unit, 
responsive both to thermal and to mechanical stimulation, showed a dynamic 
response whose magnitude increased monotonically as the degree of cooling was 
increased. The purely temperature sensitive units, therefore, showed the type of 
ambiguity with respect to temperature that led to the hypothesis, mentioned earlier 
in this review, that the “bursting” activity observed in cold fibers may be an addi- 
tional code that serves to remove this ambiguity. In the reports just cited, the 
alternative “simultaneous equation” hypothesis is proposed: smaller temperature 
changes near skin temperature are signaled by the units responsive only to tempera- 
ture, while larger increases in cooling could be signaled by the units that are sensitive 
both to mechanical and to thermal stimulation. The information that the signals in 
these units refer to temperature would be given by the presence of simultaneous 
activity in the temperature-only units. Presumably when the temperature-only units 
were silent, the central nervous system would interpret activity in the thermally- 
sensitive mechanoreceptors as representing mechanical stimulus information. 

The evidence that the Type I receptor takes part in such a scheme is negative. 
Its temperature sensitivity in the cat and monkey (15) does not predict the facts of 
temperature discrimination. The Type I population in the cat is not subdivided into 
the two or more sets of sensitivities that would be necessary under a “‘simultaneous- 
equation” hypothesis (10). The other classes of thermally responsive mechanorecep- 
tors have not been studied in this regard. 

Recently reported thermal thresholds obtained by behavioral methods in humans 
and monkeys (41) have been interpreted to support Poulos & Lende’s hypothesis 
that thermal-mechanical units may participate in temperature sensation in monkeys 
and in humans. It was suggested that in human subjects the proportion of thermal- 
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only units involved is probably greater than that in monkeys. The electrophysiologi- 
cal and behavioral studies differed in several aspects of stimulation; for instance, the 
quantitative electrophysiological data were obtained from oral-lingual regions, 
whereas the behavioral data were obtained from the glabrous skin of the hand. As 
already noted in this review, electrophysiological and behavioral data can be com- 
pared with much more confidence when they are obtained either simultaneously 
from the same subjects, or at least with a serious effort made to have identical 
paradigms, parameters, and procedures in both cases. 


C-FIBERS 


During the review period there appeared a systematic survey in cat of the respon- 
siveness of receptors with unmyelinated afferents to a variety of graded stimuli (4). 
From previous reports it was known that this population includes mechanorecep- 
tors, temperature receptors, and receptors that respond only to noxious levels of 
stimulation. In the present study on the posterior femoral nerve about half of the 
sample consisted of receptors that respond either to weak mechanical stimuli or to 
temperature changes of less than 2°C, with the great majority being mechanorecep- 
tors. Only the temperature receptors ever showed background activity in the ab- 
sence of skin damage. 

About a third of the sample were polymodal nociceptors, a class of high threshold 
units; these responded to temperatures in the 42-56°C range, to moderately intense 
mechanical stimulation, and to irritating chemicals. These characteristics differ 
from those of the A-fiber nociceptors that respond only to damaging levels of 
mechanical stimulation (8). Another group of C-fiber receptors, not as large, had 
rather varied characteristics but had in common moderately high mechanical 
thresholds, and were possibly responsive to intense cold. Roughly half of the entire 
sample was thus found to require noxious levels of stimulation, while the majority 
of the remaining units responded to weak tactile stimuli, and a few responded to 
cooling or to warming. 

The C-fiber mechanoreceptors, although sensitive, are notably less capable than 
A-fiber receptors of reporting rapid stimulus changes (3). They respond preferen- 
tially to slowly moving stimuli, they fatigue with remarkable rapidity and recover 
with remarkable slowness, and are considered to be skin receptors, not hair recep- 
tors (although they do respond to hair movement, and to stretching of the skin). 
They also respond to being cooled, although not as vigorously as they do to mechan- 
ical stimulation. The response to cooling is similarly sluggish, fatigues rapidly, and 
recovers slowly; the fairly small degree of cross-adaptation between the two modes 
of stimulation suggests that there is a moderate degree of independence of the 
mechanisms involved (24). 


VESTIBULAR SENSITIVITY 


Vestibular sensitivity was the sole topic of the somesthesis chapter in the 1970 
Annual Review of Psychology. Much of the important literature in the field appears 
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in technical reports, which were extensively surveyed in that review. The present 
review is confined to some of the single-cell and psychophysical studies that have 
appeared in the archival literature since then. . 

It has been shown that adaptation of nystagmus to prolonged constant angular 
acceleration does indeed occur (7). With the use of accelerations higher than those 
used in previous studies, 3-24°/sec?, it was found that adaptation sets in after half 
a minute, regardless of stimulus intensity. This agrees with the previous literature 
regarding subjective estimates of acceleration. 

Recently published data provide lower estimates of the absolute thresholds for 
angular acceleration in male human subjects (12, 13). Thresholds of the oculogyral 
illusion in 32 normal men for rotation about the yaw (swivel-chair) axis had a mean 
value of 0.11°/sec* with-a range from 0.04 to 0.28. Thresholds of perception in 53 
normal men had a mean value of 0.41°/sec® with a range from 0.05 to 2.20. The 
correlation among thresholds was not statistically significantly different from zero. 
In a study of 18 normal men, the same mean threshold was obtained for the yaw 
axis, a similar threshold value for the roll (cartwheel) axis, and a mean value of 
0.67°/sec* for the threshold of the rotation about the pitch (somersault) axis. (This 
last value was not statistically significantly larger than the values for the other two 
axes.) There was little correlation among the thresholds, so that measurement of 
thresholds for one axis is not a useful predictor of thresholds for the other axes of 
rotation. 

Single-cell data have been reported that suggest that semicircular canal receptors 
can respond to rotation of a linear acceleration vector around the head without 
angular acceleration, such as when one is on a platform that is hung from four 
parallel cables and is swung in a circle (1). The investigators are careful to point out 
that they were probably recording from second-order afferent fibers, and therefore 
they cannot exclude the possibility of modulation by a signal dependent on sequen- 
tial activation of macular receptors. 

Centrifuge data have been reported that qualitatively support the idea that a 
man’s perception of his body position depends primarily on utricular shear forces 
only up to 1 g, because the predicted effect on perception of body position at greater 
g values is greater than the obtained effects (62, 74). The investigators are not 
inclined to believe that vestibularly deficient men can perceive their body position 
in the absence of nonvestibular cues. 

Our understanding of vestibular sensitivity in mammals has been considerably 
extended by studies in first-order afferent fibers of the squirrel monkey (19). The 
neurons have resting discharges ranging from a few to over 200 impulses/ sec, with 
a mean value of 90. Rotation of the horizontal canal in the direction that deflects 
the cupula toward the utriculus increases the discharge rate; for the vertical canals 
deflection of the cupula toward the utriculus decreases the discharge rate; rotation 
in the reverse directions decreases the discharge rate below the resting level. The 
effect is asymmetric: the decreases tend to be smaller than the corresponding in- 
creases. A number of the units gave no evidence of sensory adaptation, so that their 
response to constant angular acceleration was that to be expected as the cupula’s 
elasticity returned it to its resting position. There were other neurons that did give 
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evidence of sensory adaptation. The mean sensitivity to angular acceleration was 
about two impulses per second per degree per second squared, but ranged from a 
quarter to double this figure. The changes in firing rate at threshold must be quite 
small. 

Sinusoidal stimulation showed that the dynamics of the system are more complex 
than those of a simple torsion pendulum (16). Four time constants, rather than two, 
are required in the transfer function for a linear system that will approximate the 
obtained data reasonably well. Sensory adaptation, and receptor sensitivity both to 
cupular displacement and to the velocity of the displacement, are believed to be 
among the factors involved. Neurons tended to differ in the regularity of their 
discharge (20), but although certain other response characteristics such as sensitivity 
and response dynamics tended to be correlated with discharge regularity, the fre- 
quency distributions for these characteristics did not provide any clear evidence for 
the existence of more than one receptor population. 

The responses to static tilt of peripheral neurons innervating otolith organs in the 
squirrel monkey have also been reported (17). Neurons believed to innervate the 
sacculus had a resting discharge around 45 impulses/sec, those serving the utriculus, 
about 80 impulses/sec. The neurons responded either to pitch or to roll, and the 
response was approximately a linear function of the calculated shear force on the 
end organs. There was a slight nonlinearity of response, the increase above resting 
discharge being slightly larger than the corresponding decrease. 


CEREBRAL CORTEX 


The third somatotopically organized projection area in cat, in the dorsal part of the 
anterior suprasylvian gyrus, has single neurons with static characteristics similar to 
those in the first and second areas. They are modality-specific, with restricted, 
mainly contralateral receptive fields (14). Microelectrode exploration of somatosen- 
sory area I of unanesthetized monkey has produced the mapping principle of the 
“dermatomal trajectory” (76, 77). The principle explains why, for example, two 
adjacent peripheral receptive fields often have adjacent cortical projections, but also 
explains why they need not be neighbors on the cortical map. It lies beyond this 
reviewer’s powers to present a verbal explanation of the principle. The interested 
reader is referred to the illustrations and accompanying descriptions in the original 
papers. 

Single-cell recordings in somatosensory area II of unanesthetized monkey show 
this area also to be organized in terms of dermatomal trajectories, although somato- 
sensory area II is served by distinctly different pathways (78). Neurons in posterior 
portions of the area are polysensory (cutaneous, auditory, visual), respond to deep 
hard pressure, and possess wide, discontinuous, frequently asymmetric receptive 
fields. Cells more anterior respond to gentle cutaneous stimuli; ipsilateral and con- 
tralateral body halves are superimposed—a given cortical neuron has precisely 
matched and symmetric body areas in its receptive fields. 

The vertical columnar organization that one has come to expect of cortical 
sensory projection areas has been reported for somatosensory area I of anesthetized 
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cat. A vertical column was often found to consist of cells responding to manual 
touch, or to tap, or to hair receptors (42). In the middle and deep layers of the hand 
areas in Brodmann’s areas | and 3 of macaque, columnar segregation of slowly 
adapting and quickly adapting units tended to be further segregated by area, with 
most of the slowly adapting units being found in area 3 (58). 

Not to be outdone by visual cortex, somatosensory area I has yielded evidence 
in unanesthetized monkey of directionally sensitive cells (79). About one third of 
the sample of cortical cells were at least twice as responsive to precisely controlled 
stroking of the receptive field in certain preferred directions relative to other direc- 
tions. The necessary control procedure showed that this was not a characteristic of 
the responses in first-order afferent fibers, thus denying the possibility of differential 
stimulus effectiveness for different directions of stroking. 

The experiments on sensations of flutter and of vibration, and the peripheral 
receptors involved in them, have been followed up with studies of the responses of 
cortical neurons in unanesthetized monkeys (53). Cortical neurons served by the 
quickly adapting receptors that are involved in the sensation of flutter have back- 
ground discharge rates of 5-20 impulses/sec. As amplitude of a sinusoidal mechan- 
ical stimulus increases, the discharge rate increases in a smooth, monotonically and 
negatively accelerating function that is not substantially different for stimulus fre- 
quencies between 30 and 200 Hz. The fact that the plateaus of one-to-one time 
locking with the cycles of the stimulus which are seen in the first-order afferent fibers 
are not seen in cortical neurons suggests that several of the latter with different 
sensitivities converge upon the cortical neuron. There is, however, a probabalistic 
time locking of the cortical discharges to cycles of the stimulus, interspike interval 
histograms showing an increased time locking with increasing stimulus amplitude. 

The cortical neurons activated by the stimuli that entrain the Pacinian receptor 
types show very rapid increases in their discharge rates with increasing stimulus 
amplitudes, but do not show a temporal ordering of discharges corresponding to the 
frequency of the stimulus. This might lead one to expect that one could discriminate 
between frequencies of sinusoidal stimulation applied to the skin only in the fre- 
quency range to which the first of these two classes of cortical neurons respond. 
Psychophysical data obtained in the present study, as well as in an earlier study by 
Goff (18), show that frequency discrimination is possible in the frequency range to 
which the second class of cortical neurons responds. The investigators therefore 
suggested that the discrimination of the frequency of a flutter stimulus is based on 
the periodicity of the cortical neural discharge; they proposed that labeled lines— 
a somatosensory analog of auditory place theory—are the basis of differential per- 
ception of vibratory frequency. 


IN RETROSPECT 


In reading papers for this chapter, your reviewer has been impressed by how often 
investigators in psychophysical studies are forced to try to relate their data to 
electrophysiological data that were obtained under experimental conditions not well 
designed for the comparison, and how often the comparable problem exists when 
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one tries to relate single-cell data to psychophysical findings. The few exceptions 
encountered in which the psychophysical and electrophysiological experiments were 
planned to be complementary make clear that this is a much more efficient way to 
proceed. 
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INTRODUCTION 


If the human organism is indeed, as Gardner Murphy (99) wrote, a region of 
perpetual interaction, a structured locus of change whose boundaries are fragile and 
indefinite, then constant change and perpetual becoming claim title to man’s essence 
and to his personality. Yet many personality studies have focused on the organism 
at one particular moment in time to fix his qualitative and quantitative distinctions. 
While they are necessary and important, such cross-sectional studies of traits, 
attitudes, or motives have slighted the continuous instability of the organism, for 
personality never Js, it is always becoming. This hyperbole highlights an apparent 
paradox: although persons are continuously changing, they are also recognizable 
over time and thus to some extent stable. Previous reviews in this series have 
reflected a concern with structure and its measurement. This review, however, will 
focus on issues of change, on the adaptive tasks posed by growth, on changing 
oneself to fit into a changing ecology or changing the environment to fit oneself; in 
short, on the longitudinal perspective of personality. 

Personologists, for the most part, have shied away from longitudinal studies for 
well-known methodological reasons, some of which are: the investigation may out- 
live the investigator; new methods of observation and measurement may render 
obsolete those decided upon earlier; measuring instruments adopted for one age 
range may be unsuitable for another; and the characteristics studied at one age may 
no longer be present at a later age. Yet there have been a few intrepid investigators 
who, defying both these inherent difficulties and the Zertgerst, have presented data 
on personality development, on the constant becoming. These studies receive more 
than cursory mention in this chapter. Issues of theory have clamored for a hearing, 
and arguments over the essential nature of man—as automaton or as responsible 
regulator of his own fate-—have appeared in several books and papers. This chapter 

The preparation of this survey was supported in part by research grant MH-19477 from 
the National Institute of Mental Health. The author is indebted to Randall Rowlett, who 
assisted most skillfully at all phases of this survey, and to Gardner Murphy, Robert Holt, and 
Herbert Schlesinger for their always helpful comments. 
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will review some of these arguments. Finally, some theorists have implicated envi- 
ronmental factors in personality change, but some obvious exogenous factors like 
chemical compounds have received only scant attention by personologists. The 
relationship of these substances to personality change will also be considered. 

Broadly speaking, information about human personality comes from four differ- 
ent sources of study. The first is that of biochemical and physiological dispositions, 
conceived to have genetic as well as environmental origins. The second is studies 
of morphology, of body type. The third is that of the societal realm, including the 
impact of social forces upon the growing child and the differentiated personal 
responses—motives, traits, behaviors, attitudes—which derive from societal 
sources. The fourth is the examination of clinical contacts with people who have 
suffered adaptive and adjustive failures and from such observations—they are gener- 
ally observations in depth and over time-—we have obtained most of our knowledge 
about the dynamics of adaptation. 

Current psychoanalytic thinking exemplifies the clinical source; role theory and 
group dynamics stem from the societal realm. Although in the past the morphologi- 
cal body-build concepts have had significant impact on personality theorists (e.g. 
136), there has been a noteworthy decline in interest in such concerns. There is 
visible increase in the numbers of biochemical and physiological approaches; these 
include not only such early studies as Schachter’s (131), but the growing literature 
on behavior genetics (46, 124), biochemical aspects of psychopathology (e.g. 112) 
and the short-term and long-term chemical alteration of moods, cognition, and 
general adaptation. From the perspective of the societal approaches as well as from 
the clinical, lively debates have occurred concerning the nature of man—whether 
he is to be conceived of as dominant over his surround, relatively autonomous of 
his social context (27, 86, 153), or a reactive organism whose general behavior can 
be programmed or at least controlled or predicted by careful manipulation of 
environmental contingencies (97, 137). A further but related debate within this 
realm concerns the issue of the developmental continuity of behavior and traits. The 
most striking development emerging from the clinical and the societal approaches 
within the last few years has been the appearance of so-called humanistic personolo- 
gies, with a concern for such qualities as “fulfillment,” “ideal values,” and “self- 
actualization” (88, 121). This development shares the quasi-utopian vision of the 
more behavioristic approaches, either by ‘‘actualizing” man’s inner potentials (the 
humanistic approach) or by programming the environment (the behavioristic ap- 
proach). In contrast the clinical enterprise emphasizes the limits, inner and outer, 
on ultimate achievement, the ubiquitous presence of conflict and ambivalence, and 
a reliance on pathology to give clues to normal personality development. Efforts to 
reduce the complexities of personality theory and to simplify the classification of 
points of view within and between personality theories led Maslow to suggest that 
the study of personality can be approached from three vantage points: 1. behavior- 
centered, learning-dominated, stimulus-response oriented, or “behavioristic,” repre- 
sented by Skinner; 2. psychoanalytic, represented by Freud and Erikson; and 
3. the existential, humanistic, or “third force” view, represented by Maslow, Rog- 
ers, or May (88, 89, 121). 
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Each of these “schools” can generate research strategies, and these reflect their 
(a) implicit view of the nature of man, (b) order of priorities for the phenomena 
to investigate about personality, and (c) preference for methods suitable to bring out 
the effects they deem important. It would seem that the task ahead requires a unified 
field approach that will combine the viable techniques of all three views so that 
investigations can examine all the visible modalities of human personality in ways 
that do not slight any of the aspects of the person. Something like this amalgam has 
already occurred in the field of experimental psychopathology—which this reviewer 
regards as a branch of personology—whose investigators use operant tenchiques on 
animals to observe the behavioral effects of chemical compounds and then speculate 
about the genetic, biochemical, and learned basis for the behavior (cf. 112, 134). 


THEORY 


A major theoretical statement on personality—metatheoretical is probably the bet- 
ter description—appears in Chein’s The Science of Behavior and the Image of Man 
(27). Chein challenges both the positivistic position as represented in behavioristic 
approaches and the existential positions. While accepting the main features of both 
these approaches, Chein views them from an organismic-motivational-transactional 
perspective. He does not deny that highly predictable stimulus-response relation- 
ships can be seen when cognitive, affective, and motivational factors are ignored, but 
believes that the simplicity of the behaviorists’ scheme is achieved at the expense 
of arbitrarily eliminating the complexity of human behavior. The view of man as 
a helpless pawn either of environmental contingencies or of biological constraints 
rests on what Chein believes is a dubious assumption—that all the determinants of 
behavior are included in man’s constitution and in the environment. 

Chein’s central integrating concept is motivation, whose paradigmatic definition 
is “behavior is a motive of the behaviors it includes.” Thus, driving a car is a motive 
for turning on the ignition; and studying for an examination is a motive for going 
to the library. In this view the person is an active agent and not a passive medium 
for the interplay of constitution and environment. Since man’s own activities affect 
that interplay, motivation is a coequal determinant of behavior and along with 
constitution and environment, one of the three sufficient conditions. Behavior is thus 
free, according to Chein, to the extent that environment and constitution do not 
dictate or preclude a particular action. 

While modifiable, motives become interrelated with and interdependent on other 
motives. They become part of a complex network of relationships and the effect of 
this motivational complex is a relatively stable, quasi-self-sustaining system of mo- 
tives “commonly referred to as character and personality.” Chein elaborates this 
statement of the subject matter not only of personality but of most of psychology, 
and makes a compelling case for his view that man plays an active role not only in 
the “causal complex of the world about him but also plays a part in shaping himself 
both as psychological being and as biological organism.” Chein’s view of the per- 
sonologist’s task is to investigate the manifold behaviors possible when the purely 
constitutional and environmental factors have been minimized—that is, to investi- 


250 HOLZMAN 


gate the conditions and consequences of motivation. The phenomena that the radical 
behaviorists and the biological determinists view as the sole legitimate object in the 
study of man Chein relegates to a subsidiary and supporting role. 

The issue of purpose, which Tolman (148) alone among the behaviorists con- 
fronted as a knotty problem, Chein embraces as a necessary part of understanding 
behavior: if a motive emphasizes the dynamic effect of the superordinate on the 
subordinate act, purpose refers to the service or function of the subordinate with 
respect to the superordinate act. 

Although the enduring motivational structure seems to be bound to the body— 
for it moves with the body and cannot survive the body—it is not identical with the 
body nor is it contained in the body. Chein’s is a transactional view of motives and 
thus of behavior and personality. In contrast, Gordon Allport regarded the person- 
ality as being bound within a physical body; Chein argues that Just as abilities cannot 
be conceived of without the objects to be manipulated when the ability 1s activated, 
so motives cannot be conceived of as independent of their behaviors which involve 
objects. 

Chein also denies that the trait should be regarded as the ultimate unit of 
personality. Rather there are six basic concepts of personality: spontaneity, directed- 
ness, the environmental conditions for the fulfillment of action, the interrelationship 
of one act with another, the distressing character of drive states (arising from the 
unavailability of a course of action suitable to eliminate the drive state, “or in other 
words, the diffuse random discharge of neural energy into motor outlets of the body 
is per se distressing and that this is an ultimate psychological given”), and finally, 
the concept of the self as a feeling of the “'hereness in the thereness,” a locus in space 
and time. In contrast, a trait is simply a regularity of an abstracted aspect of 
behavior, the regularity depending on specifiable kinds of conditions of motive and 
situation, 

These strong positions vie with those of B. F. Skinner, whose views of the image 
of man have been stated in several places and again powerfully in Beyond Freedom 
and Dignity (137). The concept of “autonomous man,” Skinner avers, is an error. 
Environmental contingencies are everywhere and they produce behavior. Intention, 
purpose, choice, decision, and desire are all replaced by the idea of an active and 
shaping environment. “The environment not only prods or lashes, it selects ... It 
is now clear that we must take into account what the environment does to an 
organism not only before but after it responds. Behavior is shaped and maintained 
by its consequences.” 

In a text that views sympathetically a social behavioristic position, Mischel (97) 
defines personality as a field of study rather than as a property of people. The 
subdisciplines or related areas of human development, cognition, creativity, learn- 
ing, and behavior observation should all claim the personologist’s attention. Mischel 
vigorously and cogently criticizes psychodynamic approaches to personology but is 
not equally critical of the social behavioristic view—that a person is what he does. 
Mischel’s point of departure is behavioral specificity; since predictions of manifest 
behavior from presumed traits, dispositions, or motives are significantly less accu- 
rate than from a knowledge of the stimulus conditions, the proper object of study 


PERSONALITY 251 


is not the inner world of the person but the social context, the stimulus conditions 
which control the person’s behavior. This reasoning, of course, assumes that the 
principal task of the personologist is to predict actual behavior in a molecular sense. 
Thus, Chein’s and Mischel’s view of what personology is all about differ radically. 

White’s new text (154) continues his eclectic but essentially psychodynamic ap- 
proach, written in the tradition established by H. A. Murray. White observes that 
we are considerably more skilled in explaining and understanding how people 
become bogged down by pathology (although these explanations are generally post 
hoc), than in understanding natural and often successful adaptive maturation and 
growth. 

Clinicians study lives in trouble at close range and over long periods of time, thus 
accumulating much more data about lives that become bogged down than do the 
experimentalists about adaptive behavior from their comparatively brief contacts 
with their subjects. White attempts to describe general principles of growth and 
development and also to understand the way individuals are organized for their 
adaptive tasks, incorporating his ideas of competence and effectence motivation, and 
integrating family, the group, and the social as central dynamic systems. 

Dahlstrom (29) attempted to answer the perplexing issue of whether behavioral 
consistencies and inconsistencies—such as they exist-—may properly be ascribed 
either to personality structure or to environmental contingencies. Dahlstrom pro- 
poses the construction of a typology in which a species type need not be identical 
with any individual member of the species. The requirements for a sophisticated 
typology—in fact, of personality structure—would include (2) flexibility so that 
shifts in classifications could be made if a personality pattern changes, as it may in 
the course of maturation (17, 81, 82, 111); and (6) the development of a taxonomy 
of syndromes of dysfunction which recognizes that the dysfunctions and not the 
individuals are classified in syndromes. Dahlstrom rejects the conception of person- 
ality traits proposed by Allport, Cattell, or Diamond as the ideal structural variables 
in this system, for such conceptions of traits do not allow for intra-individual change 
and are not responsive to intra-organismic or environmental challenges in analogy 
to physical systems such as the cardiovascular system. Dahlstrom has no ready 
candidate for the component variables in this dynamic system to replace traits, but 
he does suggest that ideas such as the distinction between trait anxiety and state 
anxiety are promising (140), as is the concept of social “roles” (23). Personality 
studies have only attempted to measure the more enduring “‘quasi-stationary” (113) 
features of human personality, for instance the efforts of Gough, Cattell, or of those 
investigating moral character (74), perceptual and cognitive styles (41, 68, 157), or 
Piagetian stages (111). We do not yet possess the metrics for assessing intra- 
individual variation over short and long periods of time including those that can be 
interpreted as attempts to maintain stability in the face of internal and external 
pressures. In order to identify and refine attributes of personological types, Dahl- 
strom argues for methods that combine formal testing procedures and scaling 
techniques. These possibly mechanical techniques are to be tempered by the human 
observer who must check on the “reasonableness” of the personality taxonomy— 
the coherences—yielded by statistical techniques. 
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Mischel’s observation (97) seems valid that prediction of behavior from traits is 
poor and that discontinuity and inconsistency in behavior are as frequent as are 
apparent continuities. Block (16) has offered an explanation of such apparent dis- 
continuities. The preferred method, he notes, for isolating regularities has been the 
correlation of behaviors presumed to be related, and results have indeed been 
disappointing. Four reasons for these disappointments are cited: 1. The behaviors 
under study may be insignificant or not salient. 2. The context must be considered. 
3. The behaviors being studied (the phenotypes) may be mediated by different 
underlying variables (genotypes). 4. Whatever behavioral consistency exists may 
break down when a person has reached a personal limit of strain. 

Loevinger (80) approached the issue of stability and change in personality from 
the vantage point of “ego development.” Just as the upright position of a gyroscope 
is maintained by its rotation, she wrote, and thrusts and counter-thrusts maintain 
the shape of an arch, so the structure of the ego and its functioning in interaction 
with other people is maintained by its activity. A comparable point of view is found 
in Piaget’s idea of “mobile equilibrium”—the more mobile the structure, the more 
stable it is. The essence of ego functioning, in Loevinger’s view, is a striving to 
master, to integrate, and to make sense of experience. The development of ego 
functioning consists in a progressive differentiation of parts and a simultaneous and 
continuous hierarchization, in which the parts are subordinated to the organic 
whole and organized and regulated by laws of the whole organism. Thus, empirical 
studies which demonstrate that people change or do not change with respect to some 
behaviors do not become proof or disproof of an ontology. Rather they highlight 
questions of what remains stable and what becomes transformed. Like a variation 
on a musical theme, the new version emerges from its earlier form, resembles it, and 
yet is different from it. 

Loevinger observes that most theories of personality view development dialecti- 
cally, and view growth not as linear and continuous but as involving turning points 
or stages. Growth of a set of qualities increases up to a point and then either declines 
or is eclipsed by the growth of another set of qualities. The new set may not only 
oppose the previous one but also incorporate aspects of it, and in this process may 
replace it and itself be changed by it. This process of dialectic epigenetic develop- 
ment is characteristic of the theories of Piaget, Freud, Kohlberg, as well as Loev- 
inger. She refers to this kind of theory as “a milestone process” in contrast to the 
“polar conceptions” of development. Milestone theories tend to be typological and 
allow for manifold arrangements of characteristics, as well as for descriptions of 
persons who do not advance beyond certain stages. Loevinger’s own theory of 
personality or ego epigenesis (79-82) is strongly influenced by the holistic orienta- 
tion and dialectic stance of Piaget, Freud, Werner, Baldwin, and G. H. Mead. Her 
labels for the successive stages she has distinguished are “autistic” (in which the 
infant is oblivious to everything except physical needs), “impulsive” (in which the 
infant is a creature of impulses and is concerned with self-demands and the demands 
of others in order to secure satisfaction), “‘self-protective” (which in small children 
is ritualistic and in older children may be opportunistic, manipulative and exploita- 
tive), “conformist” (in which rules are obeyed because they are rules and in which 
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the overwhelming concern is being loved and approved, but in which the conception 
that others have an inner life is extremely limited), “conscientious” (in which others 
are now seen as differentiated in terms of feelings and thoughts), and “autonomous” ` 
(where tolerance for the ironies of life are realized). Her descriptions of these stages 
emphasize the common elements that all persons either experience in passing 
through or ir arresting at a particular stage. Each of the stages is both a normal 
developmental period and a point at which a particular individual may be “located.” 
Among adults there are representative examples of any particular stage. No age 
boundaries have yet been described for children. This theory—as well as those on 
which it is modeled—contrasts with the “polar theories” which view personality 
development in terms of a single motive, striving, or quality such as that of “self- 
actualization” (45) or “competence” (153). In Loevinger’s view “polar theories” 
illustrate “the ontological fallacy” in that they assert that all people, however 
phenotypically different they seem, are basically driven by the same motive. These 
theories assume that the motive that may characterize one late stage of development 
can explain behavior at “all previous stages and of the transitions from stage to 
stage” (80, p. 117). Thus, in the “polar theories,” transformations of motivations 
are either denied or ignored, while such transformations are taken as given and 
indeed are the problem for explanation in the “milestone theories.” 

Loevinger and her co-workers (81, 82) have constructed a test and a set of scales 
for measuring the successive stages of ego development. These scales are in wide- 
spread use as a research tool in studies of personality development and functioning. 
The test involves the completion of 36 sentence stems, and is designed for either 
group or individual administration. Almost all of the published research involving 
this test has been with women and girls 12 years and older, but work on the 
standardization of the sentence stems for males and younger people is in progress. 
Reliability was satisfactorily high (between .75 and .85) as estimated by median 
interrater correlations for single items and for total protocols. Validity of the test 
as measured against clinical interviews was about .60. 


THE LIFE CYCLE 


The problem of continuity in development is of central concern in the major theoret- 
ical statements of the past year. It would therefore seem helpful to marshall the 
evidence for and against them. The remainder of this chapter will allocate studies 
to two broad categories: those dealing with adaptation in the course of the life cycle, 
and those dealing with adaptation in the face of exogenous factors. In the former, 
we will consider evidence of stability and change in the course of growth and 
maturation; in the latter, the stability of personality under the impact of such factors 
as drugs. 


Early Childhood 


An abiding concern of developmental personologists has been that of continuity in 
various traits and abilities, not the least of which is those behaviors loosely subsumed 
under “intelligence.” A significant review of this area by McCall, Hogarty & Hurl- 
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burt (90) made use of the longitudinal data gathered at the Fels Research Institute. 
The authors noted that Bayley’s assessment of early studies of mental growth of 
infants failed to establish predictive validity of scores earned in the first year or two 
of life, although these scores may have been a valid index of cognitive ability at the 
time of testing. These poor predictions, it is assumed, were not the outcome of the 
unreliability of the tests, and McCall et al, using the Fels’ data, determined that at 
6 months and at 12 months parental education alone is the best predictor of child- 
hood IQ. There is almost no predictive advantage in considering infant test results 
until about 18 months, when incorporating results of infant testing with parental 
education and socioeconomic status in a multiple regression equation slightly im- 
proves prediction of childhood IQ. Early vocalization is the subtest that predicts 
later IQ best, a finding that is more reliable for girls than for boys. The uneven 
predictive power of infant tests and the instability of correlations across ages sup- 
ports neither the concept of a g factor of mental development, nor a conception of 
a skill-specific, age-specific factor. The authors suggest that 


within any age (or developmental level), one could discern subsets of skills that would 
be highly related within a set but the subsets would be independent from one another. 
Some of these skill sets might possess relationships with similar or qualitatively different 
skill sets at a future age; others might lead to a ‘dead-end’ in terms of their developmental 
significance. 


Such an interpretation resembles the Piagetian and Eriksonian epigenetic schemes 
of development. McCall et al suggest that 


However, there are alternative interpretations of the content of this developmental trend. 
The exploration of objects, imitation of adult behavior, and displays of verbal fluency all 
require the infant to possess a type of social extraversion that permits him to interact freely 
with a strange examiner. Thus, the major dimension underlying normal infant mental test 
performance may not be ‘mental’ at all but rather a social inhibition versus extraverted 
personality style. Another alternative is that the developmental trend observed in the 
infant behavior may actually reflect trends and consistencies in parental behavior that are 
then reflected in the child. A mother who plays contingently or provides toys for potential 
contingent effects during the first few months of her child’s life may be the same parent 
who plays imitation games a few months later, and provides an adequate and encouraging 
language model for her child during the second year (90, pp. 742-43). 


Clearly related to the above findings and formulations are those of L. B. Murphy 
(100) who, with her colleagues A. Moriarty and G. Heider, over a number of years 
has been reporting longitudinal studies of 60 children. Her observations and tests 
of 32 of these 60 children focused on the ways the children dealt with problems of 
living at various ages, and the nature of the transitions and leaps in their develop- 
ment. Her emphasis is upon the development of individual styles of adaptation. 


Many of the children surprised us in certain ways, even while exhibiting much continuity. 
For example, a stoical and strong preschooler considered to be one of the ‘best copers’ 
in many ways became discouraged, embittered, and poorly controlled at the prepuberty 
stage; a very vulnerable infant became an unusually competent, understanding and well- 
functioning teenager; a forthright, and at times aggressive, preschool girl became a 
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sympathetic and a nurturant child at home and at school. These highlights suggest the 
range of areas in which change occurred, and the traumatic degree of these changes (100, 
pp. 96, 97). 


Changes in IQ ranged from 10 to 30 points; there were frequent spurts in growth, 
in temperament, and in response zones. As sources of change Murphy proposed 
genic differences, exogenous and endogenous traumata such as deprivations and 
illnesses, the idiosyncratic patterning of relationships between child and environ- 
ment, special vulnerabilities, and the special learning or reinforcement history. 

The concept of “coping” is central in Murphy’s theorizing about personality 
development and functioning. In her view the human organism actively strives to 
come to terms with endogenous and exogenous factors in his life-space. The relation- 
ship between the repertoire of a child’s coping techniques—in Murphy’s terms—and 
the psychoanalytic conception of “defenses” receives some clarification in one of her 
recent papers (101). Murphy notes that both defense mechanisms and coping mech- 
anisms are a part of the repertoire of the normal child and have a common source 
in the early behaviors of the infant. Yet coping is the more inclusive concept. There 
is a continuum between the earlier methods of active coping and the later methods 
of defense, some of which become patterned into defense mechanisms. By the age 
of 3 or 4 most children have already developed a repertoire of such mechanisms and 
these form a part of their normal coping resources. 

Normal children use defense mechanisms with considerable flexibility, and these 
appear neither as substitutes for active efforts nor as results of failures of active 
efforts. Murphy insists that defense mechanisms or specific coping devices neither 
inevitably lead to pathology nor guarantee normal development, the latter being a 
complex outcome of interactions between the balance of child’s vulnerabilities and 
strengths and their interaction with the expected and adventitious patterns of stress 
and support from the environment. 

D. G. Freedman (37) reported on an 8-month longitudinal twin study rating their 
filmed behavior. This study showed that the identical twins were substantially more 
alike than same-sex fraternal twins on several of Bayley’s scales. Freedman’s study 
eliminated the factor of mutual imitation inevitable in studies of older twins. While 
monozygotic twins are more similar than dizygotic twins on early interpersonal 
measures, the concordance is not maintained in later life. A second study using the 
Bayley weekly “Social Interest Scale” over the first 5 months replicated the earlier 
findings. These results provide further evidence for a significant genetic component 
in the basic behavior of social interaction. An empirical study of neonates from six 
different ethnic groups found each group unique in some way—generally in temper- 
ament. Similarities, however, were more apparent than differences. Another empiri- 
cal study found that female neonates smile more than do males and at 6 months they 
stare more at social objects. 

Thomas et al reported in 1963 (146) on 80 children studied in terms of nine 
behavioral variables: activity level, rhythmicity, approach-withdrawal, adaptability, 
intensity of reaction, threshold responsiveness, quality of mood, attention span, and 
persistence. The authors determined that initially identifiable characteristics of reac- 
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tivity are persistent features of the child’s behavior throughout the first 2 years of 
life. The category of adaptability showed an especially high level of interperiod 
correlation of stability. 

In order to measure stability, the authors divided the data into five 6-month time 
periods and employed three different forms of statistical analysis. The stability of 
“preponderance ratings” for each of the 80 children in all nine categories through 
the first 2 years showed interperiod agreement to be statistically significant for each 
child with 35.8% of the preponderant ratings being identical in all five periods 
(versus chance of 1.2%). Seventy-eight of the 80 children maintained a statistically 
reliable rank pattern in four or more categories over the first 2 years. 

The data were collected by means of parental interviews at 3-month intervals 
during the first year and at 6 months thereafter. The parents were questioned about 
concrete descriptive details, although the central concept of this group of investiga- 
tors is “temperament,” by which they mean a style of reaction rather than a specific 
kind of behavior or specific motives for behavior. The only category of temperament 
which did not show stability over time in this initial study was “activity.” 

Although later studies by this group (147), following up 136 children into the 
latency age, report that the originally noted characteristics of temperament of most 
children tended to persist into the latency years, the authors do not conclude that 
temperament is unchangeable. A factor analysis of the children’s behavioral profiles 
yielded three general kinds of temperament. A first factor, which included 40% of 
the children, was described as “an easy group,” characterized by positive mood, 
regularity in body functions, low to moderate intensity reaction, adaptability, and 
positive approach to new situations. Ten percent of the children formed a “difficult” 
group, characterized by irregular bodily functions, intensity of reaction, withdrawal 
in the face of new stimuli, slow adaptation, and negative mood. Fifteen percent of 
the children were included in a group that was “slow to warm up,” characterized 
by low activity level, initial withdrawal to new stimuli, slow adaptation, somewhat 
negative mood, and low intensity of reaction. Thirty-five percent of the sample could 
not be classified into any of the three factors. Forty-two of the children showed 
behavior problems severe enough to require psychiatric attention, and about 75% 
of this group came from the “difficult” cluster, while only 18% came from the “easy 
group.” The authors observed that different temperaments require different kinds 
of parental and educational handling. If a child’s characteristics and his environ- 
ment are in harmony, healthy development can be expected [cf. Escalona’s concept 
of stable patterns of experience (32)]. In regard to adaptation, this study considers 
both the effects of adaptability as a characteristic of the child and the necessity for 
the parenting adults to adapt to the child’s specific temperament. 

Havighurst (54) reported that knowledge of a child’s socioeconomic status is the 
best predictor of later behaviors and attributes including: IQ, level of education and 
occupation to be achieved, work attitudes, age at marriage, number of children, 
sexual behavior, and type of neurosis if any. Only during prenatal life and the first 
6 months after birth is the predictive value of social class inconsequential. If, indeed, 
socioeconomic status does predict neonatal! characteristics, the prediction is proba- 
bly based on the superior nutrition and mental care afforded by higher status. Social 
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class differences in social behavior by the age of 6 months were attributed to 
mother-infant interactions characteristic of different classes. 

Somatic and psychological factors were also integrated in a series of studies on 
the effects of enriched or impoverished experiences on brain cell anatomical struc- 
ture and chemistry of rats. Increases in number and size of synaptic junctions, in 
thickness of cerebral cortex, in activity of acetylcholinesterase, and in number of 
glial cells, were attributed to enriched environment for the first 4 to 10 weeks of life. 
These changes were not a function of increased handling, stress, or of hormonal 
variations (126-128). In a parallel set of studies, Sackett (130) was able to show that 
social responsiveness of monkeys was related to early social isolation, with males 
showing the effect to a greater extent than females. For humans, however, a study 
by Hertzig & Birch (58) suggests that a socially impoverished environment does not 
produce an inevitable and progressive decline in IQ. These investigators measured 
the IQs of 45 working class Puerto Ricans and 94 middle-class white boys and girls 
at age 3 and again at age 6. Both groups increased in IQ, with the white children 
increasing more. The issue of stability versus continuity remains unsettled by these 
studies. 

Another follow-up study of intelligence (69) reported on 24 men and 24 women 
of the original standardization populations of the Stanford-Binet and WAIS tests. 
Test scores were available from 1931, 1941, 1956, and 1969 for the Stanford-Binet 
and from 1956 and 1969 for the WAIS. For both tests stability in adulthood was 
rather high (.77 and .73). Correlations from preschool age to adulthood were low 
(.41 and .39). The mean IQs continued to rise over the years 39 to 44, with those 
men who had initially higher levels continuing to gain the most. Women, on the 
other hand, who had the highest preadult IQs gained the least in the adult years, 
suggesting that bright women tend not to fulfill their intellectual potential, a finding 
consistent with that reported in Block (19). 


Late Childhood 


A report by Kagan (67) may help clarify the problem of stability, continuity, and 
change in personality development. He, like others, noted that behavior in the first 
2 years of life is not a good predictor of later functioning in all environmental 
contexts. Kagan observed children in two Guatemalan Indian cultures and con- 
cluded that stimulus deprivation in early childhood resulted in only temporary 
cognitive lags; by preadolescence the children were demonstrating vigorous growth 
in cognitive skills. Thus, “separate maturational factors seem to set the time of 
emergence of basic functions, and experience can slow down or speed up that 
emergence by several months or three to four years, but nature will win in the end. 
The capacity for perceptual analysis, imitation, inference, language, deduction, 
symbolism, and memory will eventually appear in sturdy form.” Whatever retarda- 
tion deprivation produces is not as immutable as has been supposed. A parallel 
finding is the demonstration (141) that “depressed” monkeys—those reared in 
isolation and who show retarded and deviant socia] behavior—can be “cured” of 
their “depression” by being placed with younger female monkeys for a 26-week 
period. The forces of growth and maturation seem to have their own pace and 
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momentum; the organism is not totally the product of a tyranny of environmental 
shocks, praises, reinforcements, and other contingencies. 

Two interesting cross-sectional studies addressed themselves to the development 
of moral sense. One study involved 5000 children aged 10 to 14 from seven cultures, 
including U.S. blacks and whites, matched for socioeconomic status and sex and 
representing equal samples from each culture and grade level (143). The second 
study involved 115 American whites from kindergarten through the first year of 
college (144). The investigators report that notions of rules and laws are surprisingly 
similar across cultures. In six of seven cultures, the parents appear as the most 
powerful authorities, those able to make one follow a rule. The authors regard this 
finding as evidence that affiliative, nurturant strategies—prescriptive in nature—are 
more effective than punitive strategies—prohibitive—in inducing behavioral com- 
pliance. Tapp and Kohlberg described the children as progressing from a preconven- 
tional stage where rules are obeyed to a conventional stage where rules are 
maintained to a postconventional stage where rules are created. The modal stage for 
even the college youth was clearly conventional. These studies reflect an image of 
man that is less antisocial, uncontrolled, evil-spreading than generally conventional, 
rule-seeking, and social. 

Block (17) reported a longitudinal study of development utilizing data originally 
gathered for two earlier studies of adolescence and childhood: the “Guidance 
Study,” (84) and the “Oakland Growth Study” (65). Longitudinal data on these 
subjects continued to be accumulated, but there was no systematic attempt to 
organize, codify, and interpret them until Block’s investigation. From these two 
studies 171 subjects returned for further intensive testing in adulthood. A variety 
of data was available through adolescence for both groups of subjects, including 
school grades, teacher reports, ratings of social behavior, interviews, IQ scores, 
Rorschach protocols, TAT protocols, subjective reports, peer reports, and newspa- 
per clippings concerning the subjects. New follow-up data from adulthood was 
derived from interviews totaling an average of 12 hours each in which the subject’s 
life, career, marriage, early life experiences, and family interactions were examined. 
Also administered, but kept separate from the later analysis, were the Rorschach, 
TAT, California Psychological Inventory, MMPI interview data with the spouse, 
free play ratings, early family ratings, and interviews with the subjects’ mothers. All 
these data were not available for each subject, however, and Q-technique was used 
to evaluate the large mass of data. The task was to describe 171 adults as examined 
by a number of psychologists over a number of years including during their adoles- 
cence, and over about 3 days when they were in their 30s. Correlations between the 
composite Q-sorts at junior high school age, senior high school age, and at adult- 
heod measured global personality change in the subjects. Mean correlations between 
junior high school and senior high school ages were .77 and .75 for males and 
females respectively. Between senior high school and adulthood they were .56 and 
.54 for males and females respectively. But the wide range of correlations indicates 
that a number of subjects changed a good deal while others remained relatively 
stable. The male “nonchangers” for both transition periods seem to be self-confi- 
dent, quick, resourceful, and vigorous, in contrast to the “changers” who seem to 
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be dependent, unsure of themselves, tense and guarded, For the females, however, 
this finding is reversed with the changers eager to chart their own destinies, compe- 
tent and full of confidence. Yet even in the most extreme changers continuity is 
evident. From a factor analysis of the adult Q-sorts, Block derived six female and 
five male types, and then traced the subjects’ lives back to the earlier years to come 
up with composite developmental profiles and types. As a general finding, by adult- 
hood children fused the characteristics of their parents, favoring the more active 
parent. A child’s maladjustment tended to be correlated with maladjustment of the 
opposite-sex parent rather than with that of the same-sex parent. Patterns of impulse 
control were related to the way in which the parents handled the child’s impulsive 
behavior. Father seemed to be equally as important as mother in the child’s charac- 
ter development. Low intelligence seemed to be more predictive of life outcome than 
was higher intelligence. 

Jeanne Block studied socialization of sex roles in several samples of American 
children and of children from six Northern European countries (19). The results 
showed a surprising cross-national consistency in sex-role socialization. Little boys 
were encouraged to achieve and compete, to become independent, control their 
feelings and restrict expressions of emotion, to follow the masculine “agentic” ideal 
(6). Little girls were encouraged to develop and maintain close and interpersonal 
relationships, to talk about their troubles, to reflect upon life, and were given 
comfort and reassurance, to follow the feminine “communal” ideal (6). 


Adolescence and Young Adulthood 


A study by Block, Van der Lippe & Block (18) explored in some depth the interac- 
tion of high and low sex role with high and low socialization for men and women. 
High and low masculinity and femininity show marked differences when moderated 
by different levels of socialization. Each sex role, in western culture, has associated 
with it certain positive and negative qualities. Thus the female role implies conserva- 
tion and nurturance, but also inhibition of impulse, submission, anxiety, and the 
capacity for guilt. The male role implies initiation and instrumentality, but also 
domination, aggression, impulsivity, and egocentrism. For men, socialization has a 
broadening effect in that it joins the allegedly masculine quality of competence and 
instrumentality to the allegedly feminine quality of interdependence. For women, 
on the other hand, socialization has a more restricting effect and necessitates the 
renunication of achievement and autonomy. Thus low feminine-low socialized 
women were actively assertive and engaged in cognitive pursuits. It appears that 
socialization allows men to reap the positive aspects of both sex roles and encourages 
the disavowal of negative masculine components. For women, on the other hand, 
socialization reinforces sex-role identification and denies them the positive mascu- 
line attributes. 

An extended discussion of sex-role aspects of femininity appears in Bardwick’s 
excellent text (7). She reviews and comments on studies of sex-role behaviors and 
arrives at conclusions similar to those of Block et al (18). Bardwick’s concerns, 
however, range across issues of feminine identity, self-esteem, motivation for 
achievement, and motivational change. 
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In an empirical study of 1051 student radicals and conservatives and their par- 
ents, Block (20) sought to determine whether student protest was the result of 
unresolved oedipal conflicts, as suggested by Feuer (35) and Bettelheim (14), or the 
outcome of socialization patterns within families politically active and liberal, as 
suggested by Flacks (36), Keniston (70, 71), and Smith, Haan & Block (139). The 
data were collected in 1964 and in 1967 when student protest was considerably more 
prevalent than today. Many students held values similar to their parents’ while 
rejecting societal values. Yet a significant number were both activists and held 
attitudes discordant with those of their parents. The authors concluded that student 
protest can stem from a variety of sources, including oedipal rebellion and congruent 
socialization patterns. 

In 1965 Heath (55) attempted to validate a construct of maturity which involves 
five developmental lines: increased stability, integration, allocentrism, autonomy, 
and the availability of experiences to awareness. He examined the 10 most and 10 
least mature Juniors and seniors at Haverford College as selected by members of the 
faculty, administration, and student body. The subjects were interviewed and given 
tests including the Rorschach, TAT, Adjective Checklist, the author’s own Self- 
Image Questionnaire, the California F-Scale, the Stroop Color-Word Test, and the 
MMPI. Predictions derived directly from Heath’s model of maturing were generally 
confirmed by test results: mature men were shown to be significantly more stable, 
congruent, allocentric, and ‘accurate in their self-image. Interview data generally 
supported the maturing construct, that is, immature men reported more incidents 
of parental emotional instability and tended to be more critical of their families, had 
fewer and less important achievements to report and described themselves in less 
positive and goal-oriented terms. Mature men surpassed the immature men in 
academic achievement although the two groups did not differ in intelligence. The 
mature men participated more fully in all phases of the extracurricular and athletic 
programs, appeared to have many more affectionate and close relationships, and 
made fewer visits to the infirmary. Of considerable interest is the finding that both 
the mature and the immature men experienced regressive periods of disorganization, 
and Heath concluded that effective adaptation does not preclude temporarily re- 
gressing to more autocentric modes of behavior. A more mature person can permit 
himself to become temporarily unstable, disintegrated, and autocentric if adaptation 
requires such regression [cf. Kris’ concept of “regression in the service of the ego” 
(75), and Schafer’s elaboration of that idea (133)]. In a later study, Heath (56) 
followed up the personality changes of the Haverford students over a period of 
several years to test his model of maturity in the developmental process, to study 
various types of maturing effects, and to explore in a longitudinal fashion the fate 
of the students. He judged that the most important maturational change in college 
was the progressive integration of students as influenced by their interpersonal 
relationships. A follow-up study found that the maturing effects of college persisted 
and sometimes was even more evident after graduation. The foremost change re- 
ported was the resolution of the “identity crisis” and stabilization of the self-concept 
around central determining and integrative values rather than around interpersonal 
relationships as predicted from the undergraduate data. Heath stresses the role of 
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“supportive disorganization” in the development of mature values and intellective 
skills. Freshmen tend to become unstable and disorganized when first confromted 
with the new environment of college but they eventually become more stable and 
autonomous. Heath’s studies, like Neugarten’s formulations (see below), document 
development as a continuing process, taking place over a long time span, and not 
as a mere endogenous unfolding maturation. Personal relationships and experiences 
have powerful leveling and shaping effects. It is important to note, moreover, that 
the same environment that had a positive effect on the more mature students had 
a retarding effect on the immature sample. 


Adulthood 


Most studies of personality development consider adaptation and maturation from 
the early years up to early adulthood. Even as late as 1968, in a book devoted to 
the development of personality (78), the author devoted 9/10 of his book to the years 
from birth to young adulthood, thus giving only 1/10 of the space for the rest of 
the life cycle. Neugarten (102) commented, “One cannot help but wonder why we 
have had such a long prologue and first act, only to find that we have no play to 
watch. What happens in that period of life, that is——at least for most middle-class 
people—a great second act? And what happens in the third act, which the author 
himself describes as the climax of middle age? And is there—surely not—no move- 
ment in the final act...” 

The central idea of a collection of papers devoted completely to parenthood (4), 
and strongly oriented toward psychoanalytic theory, is that the parental role is a 
changing one that follows an epigenetic sequence. Not only has the parental role 
evolved phylogenetically, but it continues to evolve for each parent in interaction 
with the growth of the child. Jessner, Weigert & Foy (63) examined women’s 
conflicting motivations about pregnancy and the attitudes and unconscious fantasies 
that are experienced prior to the birth of the child. A number of transformations 
are observed, especially during the first pregnancy that correspond to the physiologi- 
cal changes of each trimester. A study of the effect of pregnancy on 29 husbands 
was undertaken by McCorkel (92), whose major interest was to define the nature 
of the crisis in becoming a father and the adaptive responses to it. He distinguished 
three groups of husbands: romantically oriented, career oriented, and family ori- 
ented. As with the expectant mother, the husband’s attitudes changed during preg- 
nancy, depending significantly on his dominant orientation toward marriage and 
pregnancy. Anthony (2) examined both normal and pathological parental responses 
to children in the oedipal phase. He observed that the great majority of parents are 
unaware of their involvement in their child’s oedipal transition, though reactions 
may range from amused awareness to a revival of the oedipal crisis in the parent. 
Kestenberg (72) studied the effects on parents of having to share their latency-age 
child with the community. She states that in this transitional phase the child 
threatens the parents by making demands that they change their attitudes to suit 
his needs. A major measure of the parents’ acceptance of their child’s latency is their 
willingness to surrender the child to the teacher. The child’s own widening social 
contacts also guides the parents’ socialization. Anthony (3) proposed five contempo- 
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rary stereotypic reactions of parents to adolescents: the adolescent as a dangerous 
and endangered object, as a sexual object, as an object of envy, as lost object, and 
as a maladjusted individual. He argues that the response of any parent to an 
adolescent may be dictated by three factors: a collective reaction based on stereo- 
types, an idiosyncratic reaction based on the personalities and the experiences 
involved, and a “transference” reaction based on pre-existing factors from an earlier 
stage of the parents’ life. 

According to Neugarten (104-108), in the middle and older years chronological 
age as a variable changes in significance. It is the use of one’s experiences which 
prove to be significant markers in the development of adult personality functions: 
the ways one confronts work, love, time, and death; the changes in self-feeling as 
a function of marriage, parenthood, retirement, widowhood, and illness. Mere speci- 
fication of chronological age in personality studies without reference to such inter- 
vening experiences and their mastery contributes little to the study of lives and is 
a poor predictor of life outcome. Thus Neugarten’s explanation (103) for the ineffec- 
tiveness of long-range predictions from childhood or adolescence into adulthood 
rests on a failure to acknowledge the importance of such intervening variables as 
marriage, parenthood, job failure, or widowhood. The salient issues of childhood are 
no longer salient during adulthood, and it is inappropriate to predict vocational 
success from school success, adult pathology from childhood withdrawal, or adult 
attainment from early retardation. 

Rosen & Neugarten (122) found that “ego energy” as measured by the TAT— 
and defined as the ability to integrate wide ranges of stimuli, to perceive and deal 
with complicated or conflictual situations, to perceive affects as important in life, 
and to be actively assertive—shows a decrease with increased age. Neugarten refers 
to this change as increased interiority (104). No relationships, however, were found 
between ego energy and sex or socioeconomic status. In a cross-cultural longitudinal 
study, Gutmann (50, and 105) confirmed his earlier finding that with increased age 
there is a trend—as measured by TAT stories—away from active realistic mastery 
styles to regressive, passive, and magical] styles of dealing with conflict. This finding 
probably reflects an increasing difficulty to manage one’s inner life and to think 
logically in old age. 

Havighurst (53) noted that the psychological response to retirement depends on 
one’s attitude toward work. Data indicate that retirement does not cause ill health, 
rather ill health is a critical factor in many decisions to retire. Busse (22) noted the 
serious impact on personality of the loss of hearing, a primary means of maintaining 
contact with environment: there is a tendency to become suspicious and paranoid. 
Misunderstanding by others of a partially deaf person’s hesitation to respond can 
precipitate or speed major disruptions in interpersonal relations. Significant differ- 
ences between the Rorschach protocols of older persons with and without hearing 
loss have been found. Busse also suggests that those who survive well in old age are 
self-centered individuals, less involved with other people and better able to accept 
the loss of a spouse or child. 


PERSONALITY 263 


EXOGENOUS FACTORS 


Although personologists have considered culture, environment, and family atmos- 
phere in personality growth and development, they have generally neglected a 
number of other exogenous factors such as nutrition and drugs. 

The effects of chronic malnutrition can be seen in stunted growth and in develop- 
mental lags in intellectual functioning. Physically there is a relationship between 
chronic malnutrition and the reduction in number and size of brain cells, extent of 
myelinization, structural changes in the brain and spinal cord, and a number of 
pathological central nervous system lesions (15). In rhesus monkeys a protein- 
deficient diet produced behavioral deficits (158). Thus effects of malnutrition on 
personality development and functioning would seem to be considerable, although 
these effects have not been precisely catalogued as yet. 

Public attention has recently focused on the use and abuse of pharmacologic 
agents, and scientific interest in the personality predispositions to drug abuse and 
the short- and long-term effects on personality functioning of a number of these 
substances has also increased. 


Predispositions 


ALCOHOL The most familiar toxic substance abused by man is alcohol. There ts 
wide agreement that young people—adolescents—are introduced to alcoholic bever- 
ages in their own homes (87) and that adolescent problem drinkers come from 
homes in which one or both of the parents are heavy drinkers (49). Peer experiences 
also influence the development of drinking patterns (64). Of considerable impor- 
tance as well are religious sanctions—-proscription or prescription—concerning 
drinking. Jewish customs prescribe consumption of alcohol and indeed regulate its 
intake even to the extent of prescribing periods of excess, thus controlling its intake. 
In Protestant usage alcohol is either forbidden or unregulated and thus uncon- 
trolled. It is therefore understandable that studies have shown that heavy drinkers 
among young adults or adolescents come from homes in which the parents are heavy 
drinkers, have friends who drink heavily, and either have weak religious affiliations 
or belong to a religious group that proscribes drinking. 

There have been many attempts to describe the personality qualities of heavy 
drinkers. The clinical literature has emphasized qualities of the “essential alcoholic” 
which include “low anxiety tolerance, generalized avoidant tendencies, egocen- 
tricity, weak introspection, and demanding, dependent relations” (132). Only a few 
studies of this pattern have yielded findings statistically valid. Williams (156), on 
the basis of an adjective checklist, described the male college student heavy drinker 
as anxious, impulsive, aggressive, with little concern for others and with a low 
opinion of himself. Competence and achievement are generally lower in the adoles- 
cent drinker than in the abstainer. Jones (66) reports a similar constellation for 
female students who are heavy drinkers. McClelland (91), using the TAT, con- 
cluded that the young alcohol abuser is highly concerned with his sense of potency 


264 HOLZMAN 


and that drinking seems to increase the subjective feeling of power. Mogar et al (98) 
studied 101 male and 100 female alcoholic patients using the MMPI. The authors 
identified five distinct types among the female sample (normal manic, normal de- 
pressive, hysterical, psychopathic, and passive-aggressive) and six distinct types 
among the male sample (passive-aggressive, depressive, compulsive, schizoid, pre- 
psychotic, and passive-dependent). Each type, the authors conclude, needs different 
therapeutic intervention. 


NARCOTIC DRUGS Most studies report that the users of heavy narcotics such as 
heroin come from broken families (123, 149) and are members of ethnic minorities 
(13, 25, 149), living in a densely populated urban area with significant impoverish- 
ment and disorganization in the environment. Personality correlates of narcotic 
abusers include immaturity, low frustration tolerance, a tendency to avoid anxiety 
rather than to cope actively with frustration, superficial interpersonal relationships, 
and a lack of clear goals and healthy resources (43, 76, 118), qualities which 
clinicians have recognized on the basis of less systematic observations. Recent 
studies have proposed a contagious disease model for the spread of some narcotic 
addictions such as heroin use (62). Such a model would require alignment with the 
findings about specific personality predispositions to addiction. It may be that 
contagion is of greater importance in certain highly disorganized areas of the inner 
city, while in less deprived sections of a metropolis personality disposition is a 
principal determinant. 

Farnsworth (34) distinguished between students who are curious about drugs, 
experiment with them, and then abandon them, and those who are seeking to solve 
their problems with drugs. The latter group is characterized by feelings of inade- 
quacy, disillusionment, and fear of intimacy. Freedman (39) also suggests that many 
of these drugs offer “a lure from the acquisition of competence,” a way of shortcir- 
cuiting personal growth and avoiding confronting the complexity of reality. Hinck- 
ley et al (60) studied 451 students attending a university psychiatric facility and, 
using the MMPI, compared a number of characteristics of drug users and nonusers. 
Drug users came predominantly from the college of liberal arts and the graduate 
school. Students with active religious affiliations had a lower percentage of use. 
Thirty-seven percent of the students rating their mental health as poor were users. 
The users tended to be nonconformists with antisocial feelings and barely suppressed 
feelings of anger. Robbins et al (120) reported that drug users at two colleges 
described themselves as moody and unhappy. 


BARBITURATES Anumonye (5) studied 20 barbiturate-dependent subjects and 20 
nondrug-dependent controls. Self-administered questionnaires, psychological tests, 
structured interviews, and an examination of case records show that the addict 
group was psychiatrically ill in ways similar to the control group, with a common 
personality type and no definable psychological characteristics which would distin- 
guish the addicts from the nonaddicts. 
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MARIHUANA Brill et al (21) report that greater use of marihuana is related to 
higher scores on stimulus-seeking scales and the psychopathic deviate scale of the 
MMPI. More males than females report daily use. While users of marihuana mani- 
fest greater psychopathology than do nonusers, those who used marihuana as well 
as other drugs show even more impairment (51, 52). Yet with marihuana smoking 
increasing, it is not clear whether these trends will persist. 


PSYCHOTOMIMETIC AND ANALEPTIC DRUGS McGlothlin & Arnold (93) reported 
the result of a 10-year follow-up study of 247 persons who had taken LSD in either 
a therapeutic or experimental setting. They were reinterviewed 10 years later, and 
comparisons were made between subjects who used LSD only under the direction 
of a therapist or experimenter and those who had continued to use LSD on their 
own. The authors concluded that LSD attracts persons who have a tendency to 
experience naturally occurring mind-alerting experiences and to value intuition and 
a casual, spontaneous style of life. Smart & Fejer (138) concluded that among the 
users of LSD, behavior disorders and psychoses were more frequent than would be 
expected by chance. There were strong indications, however, that much of the 
psychopathology predated the use of LSD. The LSD users were also underachievers, 
though their IQs and ages were similar to those of the nonusers. Welpton (151) 
studied 10 chronic LSD users by means of psychiatric interviews and psychological 
tests. Eight subjects also had extensive experience with amphetamines. The author 
concluded that the subjects used LSD to relieve their chronic despair and to find 
some meaning in life. They hoped to be open to all sensations, to find what was true 
and genuine in life, to be more aware of things around them, and to become more 
creative. By temporarily breaking down self-object boundaries with the drug, they 
sought to achieve intense intimacy with others. Their use of LSD supplied a group 
identity, made them feel cared about, and also provided a purpose to life: to prosely- 
tize for psychedelic experience. A significant characteristic of the users of hallucino- 
genics is their clannishness and cultist behavior. Another investigator found that the 
personality dispositions of these users include poor modulation of hostility, in- 
tropunitiveness, low self-confidence, excessive passivity, and poor cognitive orga- 
nization (38, 40). 

Bell (9) obtained histories from 34 of a group of 40 amphetamine addicts and 
determined that one or both of two classes of stressful events preceded the addic- 
tions: the rejection by or separation from a person on whom the patient was emo- 
tionally dependent, and a transition to a more demanding adult role (independence, 
heterosexual intercourse, marriage, parenthood). The addicts themselves, however, 
denied the importance of these events. The population of users and abusers of 
psychotomimetic drugs in contrast to the users of narcotics is predominantly middle 
and upper-middle class (52, 73). A high proportion of users have had parent losses 
early in life (44). One study reports that in the families of high users there is a 
prevalent atmosphere of emotional instability, with inadequate setting of limits on 
children, poor tolerance for anger, and generally unempathic relationships among 
family members (73). 
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In a study of the relationship between personality and addiction, Chein (26) 
concluded that neither maladjustment nor personality structure per se 1s sufficient 
to account for addiction. Two other sets of determinants are involved: influences of 
social climate, attitudes, values, stresses, and gratification current in the community. 
Addiction is most probable if the use of opiate drugs is adaptive, functional, or ego 
syntonic. These changes operate at four levels within the individual: the level of 
conscious experience, of certain defenses, of unconscious processes, and of psycho- 
physiological reactions. 


Acute Effects of Drug Ingestion 


There are numerous reports that narcotic, stimulant and hallucinogenic drugs in- 
hibit productivity, creativity, and meaningful interpersonal relationships. Once vul- 
nerable students begin taking such drugs, their entire lives revolve around them (34). 
Barbiturate intoxication dedifferentiates mental structure and has significant effects 
on memory. Six normal subjects were trained on a series of tasks involving different 
aspects of memory and attention and their individual sensitivity to secobarbital was 
determined. They were tested at several levels of the drug on a double-blind basis, 
and replicable dose-response curves were produced. When the test demand was 
great, even high doses had no apparent effect. When the demand load was moderate 
or mild, a dose-related drug effect was obtained. One investigator proposed that 
barbiturate addiction is far more destructive to personality structure than is opiate 
addiction (77). 


MARIHUANA The studies of marihuana thus far are limited to acute effects, and the 
results are not as consistent as are those dealing with the narcotics, the barbiturates, 
and the psychotomimetic drugs. Brill et al (21) report that marihuana is considera- 
bly safer than alcohol or barbiturates. It causes no physical dependence or tolerance, 
although it may produce psychic dependence and habituation. Some clinical studies, 
using an average of 2 grams of marihuana containing one percent delta-9-tetrahy- 
drocannabinol (THC) found little if any impairment on simple psychological and 
perceptual-motor tests. Renault et al (117), however, in a study of the development 
of tolerance to marihuana, reported a dose-response effect on heart rate and a 
decrement in time estimation, although there was some improvement in perfor- 
mance over time. After repeated administration, five of seven subjects believed 
incorrectly that the potency of the drug increased over time, providing some evi- 
dence of an accumulation effect. Some toxicity was evident in four of the subjects 
who complained of perceptual distortions and a tendency toward disorientation and 
confusion. One subject underwent a psychotic reaction and became confused about 
his identity. The level of toxicity seems to have been related both to dosage and 
emotional susceptibility, for the subjects receiving the strongest dosage (2.8% THC 
twice a day), and the one subject with a history of depression “were those who 
displayed the most abnormal behavior.” Marihuana, however, displays no clinically 
obvious withdrawal symptoms, and all drug effects were reversed immediately with 
a return to placebo. 
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ısp A study of 30 experimental subjects and 20 controls had subjects perform a 
variety of psychological tasks while under LSD and placebo. Typical LSD effects 
included difficulty in thinking and in controlling thoughts, especially in directing 
attention, loss of control over emotions and behavior, a feeling of numbness in the 
body, nausea, perceptual changes in size, shape, weight and proportions of the body 
image, loss of contact between self and environment, and difficulty in speaking. 
Generalized slowing up appears to be one of the reliable effects of the drug (8). 


AMPHETAMINES Cox & Smart (28) report that the acute effects of amphetamine 
ingestion are rapidly felt. The subjective experiences include an initial sharp arousal 
of intense pleasure and immediate physiological changes in blood pressure, pulse, 
and pupil size, and blurred vision. Symptoms such as insomnia, appetite loss, 
micturition difficulties, thirst, and increased energy are accompanied by heightened 
activity and alertness and a feeling of euphoria. These writers report that chronic 
ingestion of amphetamines can lead to memory impairment, severe gastrointestinal 
symptoms, lethargy, depression, and paranoid psychosis. Griffith & Oates (48) 
administered 10 mg of amphetamine every 4 hours to four healthy male volunteers 
for as long as the subjects could tolerate that amount. All developed a brief paranoid 
psychosis within 5 days, and one subject developed a psychotic reaction within 24 
hours. Euphoria was a common response to early administration but disappeared 
when the cumulative dose reached 50 mg. A 1972 study (47) attempted to replicate 
this finding by eliminating possible confounding variables such as prolonged sleep 
deprivation, combined drug effects, or latent psychosis. Nine adult male volunteers 
with previous high-dose amphetamine experience were used. Psychological testing 
showed no indication of borderline psychosis, schizoid personality, or organic brain 
damage, but some sociopathic tendencies were noted. A 6-week drug-free hospitali- 
zation and a one-week control period preceded the intravenous administration of 
10 mg of dextro-amphetamine sulfate (D-amphetamine is 10 times more potent than 
L-amphetamine) and subsequent hourly oral doses of 5 to 10 mg when they could 
be tolerated. Four subjects were kept on the drug until a clearly defined psychosis 
was observed. One of the two subjects who received the 5 mg hourly doses experi- 
enced a brief paranoid reaction. Of three subjects withdrawn from the drug during 
the prepsychotic phase, one subject nevertheless developed an acute psychotic reac- 
tion. As in the 1968 study, all subjects appeared depressed when the cumulative dose 
exceeded 50 mg. Paranoid delusions were common in all subjects developing a 
psychosis, but there was no evidence of auditory or visual hallucinations. Psychotic 
behavior ceased when the drug was discontinued, in most cases within 8 hours, 
although in one case suspiciousness persisted for several days. Ellinwood (31) re- 
ported similar symptoms including hypersexuality and visual illusions. 


PHENCYCLIDINE Meltzer et al (96) reported that phencyclidine (Sernyl) produced 
in all subjects a proprioceptive disturbance, followed by a disorder in thinking like 
that seen in schizophrenic psychoses, thus confirming an observation made by Luby 
et al (83). Neither the thought disorder nor the observed rise in plasma creatine 
phosphokinase (a muscle enzyme reported increased in acute psychoses) in some of 
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the subjects was observed in the toxic conditions resulting from other drugs such 
as amphetamines. 


MALNUTRITION The lack of critical nutritional substances has been shown to have 
profound effects on personality and behavior. Birch & Gussow (15) reported that 
the single most common behavioral finding in malnourished children is apathy, 
accompanied by irritability. The child’s ability to respond appropriately to signifi- 
cant stimuli in his environment is reduced during the period of chronic malnutrition, 
and continued malnutrition is accompanied by progressive behavioral regression. 


Individual Differences in Response to Drugs 


Most investigators report differential responsiveness to drugs. Anderson (1) noted 
marked variation in the ability of different individuals to develop tolerance to 
barbiturates. Renault et al (117) reported marked individual variations in subjects’ 
response to marihuana. The most extensive controlled investigation of individual 
responsiveness to drug use, however, has been the study by Barr et al (8), which 
demonstrated that each person reacted in ways that were highly congruent with an 
independent assessment of his defenses, coping resources, cognitive styles, and other 
aspects of personality structure. Changes occurring under drug intoxication state 
were related to personality dispositions in the predrug state. The subjective reports 
and statistical analysis of behavioral and psychological test data distinguished sev- 
eral patterns of personality organization and associated drug response. The descrip- 
tions of these patterns are full, rich, informative, and represent a model for 
integrating objective test data with clinical material. Another team of investigators 
(95) noted that subjects who placed strong emphasis on structure and control, used 
extremes for self-description, had high scores on scales of severity of Judgment and 
dogmatism, and high scores on the active-aggression and achievement-competition 
scales of the TAT, and who had high GSR tended to respond minimally to an LSD 
experience. On the other hand, subjects who scored high on a hypnotic susceptibility 
test, preferred an unstructured life style, and displayed greater esthetic sensitivity, 
imaginativeness, passivity, and noncompetitiveness, responded most intensely to 
LSD. The latter group also tended to report more enduring effects. Widrow (155) 
also reported that LSD effects are determined by pre-existing character structure, 
personality state, and circumstances associated with drug taking. Ellinwood (30) 
observed that although there is a dose-response relationship with amphetamine, 
amphetamine psychosis occurred more often in schizoid than in antisocial subjects. 

There seems little doubt that the effects of drugs is a complex interaction between 
the pharmacological activity of the drug, the dosage, the physical stamina, and 
personality characteristics of the subject, and the situation in which the drug is 
taken. 


Long-Term Effects of Drugs and Malnutrition 


No scientific studies have reported on long-term effects of marihuana or hashish (e.g. 
145), although there are anecdotal reports of personality changes occurring with 
chronic marihuana use (21). Indeed, the aggregate of evidence overwhelmingly 
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suggests that just as excessive, long-term use of alcohol has deleterious physical, 
social, and psychological effects, so might marihuana (and any other drug). To 
extend the comparison, marihuana seems to be no more eroding of personality 
stability than is the temperate or occasional use of alcohol (159). Yet the effects of 
long-term heavy use have not been sufficiently studied so that antecedent and 
consequent factors can be clearly distinguished. But the subtle behavioral effects of 
marihuana intoxication moved Freedman (39) to comment, pending further study 
of chronic use: 


Marihuana does produce a disinhibition which can affect motor skill and judgment. A 
person high on marihuana does not, however, show ataxia; this means he does not signal 
others about his potentially poor judgment ... Nor does marihuana signal the user with 
gastritis, nausea, and hangover to restrain his input ... Accordingly, daily trips—escapes 
from reality—cou/d readily occur as a habit leading to disregard for reality. 


Barr et al (8) report no lasting effects, either positive or negative, from their 
experiment with LSD. The most systematic follow-up study of drug use is that of 
McGlothlin et al (95). Of 72 male graduate students who were followed up at 6 
months, 24 had received 200 micrograms of LSD in each of three sessions. Fifty- 
eight percent of the experimental group subjectively reported some lasting effects 
after 6 months. This was consistent with a finding that in that group GSR in 
response to stress situations dropped. However, psychological changes were not 
impressive. Since the subjects who reacted most strongly to the drug experience were 
also those who were most likely to report enduring effects, it is difficult to separate 
motivation to change from enduring drug effects. Another study (94) found evidence 
of minimal brain damage in heavy LSD users. Significant impairment on the Hal- 
stead Category Test followed heavy ingestion of LSD, although clinical examination 
did not reveal significant signs of organic impairment. In this study also it was not 
possible to separate antecedent from consequent conditions, nor was it possible to 
separate the effect of a persisting toxic effect from a more enduring chronic brain 
syndrome. In McGlothlin & Arnold’s 1971 study (93), the majority of those subjects 
who took LSD under medical advice felt that their LSD experience had resulted in 
some lasting effects, but the proportion reporting change on individual variables 
over time did not exceed that of the control group. Those who continued to take 
LSD without medical advice, in comparison to the control group, demonstrated 
large and consistent differences on subjective and objective measures, a finding that 
suggests that LSD does not so much produce change as attract a particular kind 
of person—-evidence again for a significant interaction of drug effects and motiva- 
tion. 


OTHER EXOGENOUS FACTORS 


The measurement of aggressive behavior and aggressive inhibitory tendencies, de- 
scribed in a set of experiments by Olweus, took account of the social context, 
personality of the aggressor, and stimulus conditions (109). The author devised an 
“aggression inventory” which may be found useful by personologists. Holzman & 
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Rousey (61) demonstrated that preventing a person from hearing his own voice 
while he speaks has a disinhibiting effect, particularly in those people who manifest 
a leveling-repressive cognitive-control disposition. Their experiment suggested that 
different personality structures require a different quality and quantity of stimulus 
input for smooth development and functioning, a “psychoecology of everyday life,” 
as it were. 


THE HIGH-RISK METHOD 


The persistence of strong circumstantial evidence for the genetic transmission of a 
vulnerability to many forms of psychopathology (46, 59, 124, 125) has stimulated 
the use of the “high-risk” method of research. This method seeks first to identify 
those who are assumed to be vulnerable to a disease, disposition, or defect, and then 
to observe which ones of this population actually develop the pathology. Measures 
taken prior to the manifest appearance of the pathology, and which distinguish those 
who do and do not succumb, can yield important information on the nature of 
vulnerability. Garmezy (42) has succinctly described several strategies for selecting 
subjects for vulnerability research: genetic (on the basis of a parent’s history of 
disease), psychogenic (on the basis of manifest symptomatology already present in 
the child), sociogenic (on the basis of presumed noxious social conditions), and early 
neglect (e.g. birth complications, nutritional defects). The mere presence of a men- 
tally ill parent in the home has pathogenic effects on the child. Whether this 
vulnerability is a function of genetic or family-interaction factors or interactions of 
both is a question posed but not definitively answered by a study of 253 families with 
652 children. The authors (119) reported that the mental illness of mothers has more 
of an effect on children than does the mental illness of fathers. Preschool children 
escaped the effect of the ill parents more easily than did older children. In children 
of patients hospitalized with tuberculosis there was virtually no serious upset. 
High-risk methods will be used increasingly and will become major tools in person- 
ality and psychopathology research. Their effective use, however, depends upon the 
observation of meaningful variables. 


NEW TEXTS 


A number of new personality textbooks appeared in 1972, Maddi’s Personality 
Theortes (85) has been revised with expanded treatments of Angyal, Murray, Mas- 
low, and McClelland. Maddi & Costa (86) have written an illuminating source book 
on Allport, Maslow, and Murray as representatives of a “humanistic” tradition in 
personality theory. B. Weiner (150) has contributed a thoughtful treatment of 
motivation. It is indeed more than a text on motivation and can vie for a place in 
personality courses. Wenar’s (152) book views personality from a developmental 
vantage point and gives strong emphasis to the early years. Rotter, Chance & Phares 
(129) have written a text emphasizing the social learning view associated with 
Rotter. It contains copious experimental applications of the “internalizer-external- 
izer” typology. A small but meaty text on personality theorists—particularly role 
theorists—is that by Heine (57). L. Rappoport (114) has offered an interesting 
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developmental personality text which spans the life cycle although it devotes only 
17% of its pages to the years after 25. Razran (116), by expanding his classic paper 
on Soviet psychophysiology (115), provides us with a valuable key to the literature 
emerging from the post-Pavlovians. A major test by Cattell (24) gives the reader an 
in-depth survey of the psychology of abilities and general intelligence from a physio- 
logical, phylogenetic, and ontogenetic view, with explanatory psychometric 
schemata. 


CONCLUSION 


Resolution of prominent issues in personality studies requires methods adapted to 
the problems. Changes, development, emergence, stability—these are some of the 
areas whose elucidation is limited by cross-sectional studies. Until recently, how- 
ever, the shortcomings of longitudinal techniques have hindered use of prospective 
studies. New developments in method—perhaps incorporating the suggestions of 
Bell (11) and of Schaie & Strother (135) to combine cross-sectional studies with 
scheduled reappraisals to yield information about age-related changes-—no doubt 
will spur increased usage of longitudinal techniques in the near future. More per- 
sonologists may show a willingness to grapple with the complexities of longitudinal 
methods rather than content themselves with neat but time- and significance-limited 
reports. Separation of antecedent from consequent events, the understanding of 
effects of environmental contingencies, and of adventitious and self-administered 
endogenous factors, await the wider adoption of the longitudinal view of persons. 
Some speculative statements about man as free agent may not be sustained by the 
new data. It might well turn out that environmental factors will affect different 
persons differently, depending upon their genic endowment, constitutional factors, 
the meaning of particular adventitious factors to them, their coping and defensive 
repertoire, and the social context to which they adapt. 
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BRAIN FUNCTION: Changing Ideas «218 
on the Role of Sensory, Motor, and 
Association Cortex in Behavior 


R. B. Masterton and M. A. Berkley 
Department of Psychology, Florida State University, Tallahassee 


INTRODUCTION 


The classical three-way subdivision of cerebral neocortex into sensory, motor, and 
association areas is the product of a convergence of experimental and clinical 
evidence over almost a century (cf. 203). However, in the last 20 years, evidence 
difficult to reconcile with this idea has begun to accumulate. For example, we now 
know that neural activity in sensory cortex does not just mirror receptor activity, 
nor is motor cortex the “keyboard of the mind.” Furthermore, we know that 
association takes place both in animals with little or no association cortex and in 
animals in which the normal complement of association cortex has been surgically 
removed (e.g. 235, 236, 333, 334). 

Because of these obvious difficulties with the classical view, it seemed worthwhile 
to review the main lines of inquiry that have tended to demand a more sophisticated 
view of the contribution of the various areas of cerebral neocortex. However, to 
provide a format that is easy to follow, we have retained the classical distinctions 
as the basis for subdividing the review itself. Therefore, there are six main sections 
dealing respectively with visual, auditory, and somatosensory cortex, motor cortex, 
and finally posterior and frontal association cortex. Each of these sections has been 
further divided along methodological lines (i.e. anatomy, physiology, and behavior). 


VISUAL CORTEX 


Research on visual cortex has always played a dominant part in theories of the role 
of cortical sensory areas, and certainly the recent past is no exception. Along with 
the invention of new techniques, this dominance has brought about a virtual explo- ` 
sion of research on visual cortex ranging from its anatomy and physiology through 


277 


278 MASTERTON & BERKLEY 


ablation-behavior experiments. Specific reviews of this vast literature can be found 
in Bishop (27) and in Weiskrantz & Cowey (425). 


Anatomy 


The cat and the monkey continue to be the most popular animals for anatomical 
investigations of “visual cortex.” In the cat, Otsuka & Hassler (306) have delineated 
the major cortical areas related to vision on cytoarchitectonic grounds, and have, 
in addition, related these areas to the varied fissural patterns found in cat brains. 
A more recent study by Sanides & Hoffmann (353) has confirmed previous descrip- 
tions and added some new details regarding subdivisions within visual and visually 
related cortex. These two studies provide the current anatomical touchstones for 
electrophysiologists and behaviorists working with cats. Relatively little recent work 
along these lines has been done on the already well-studied monkey striate cortex. 

A number of other animals [the tree shrew (95), hedgehog (214), and grey squirrel 
(167, 215)] have now been examined to determine the locus of the visual cortex on 
the basis of cytoarchitecture. In general, these studies have found that the largest 
area of representation is for the binocular visual field and that anatomical bounda- 
ries and functional boundaries (as derived from electrophysiological mapping exper- 
iments) coincide very well (167, 214). 

Classical descriptions of the thalamic projections to cortex have held that the 
cortical target area of the dorsal lateral geniculate (DGL) is exclusively striate 
cortex (321). While no evidence to the contrary has thus far developed for primates, 
a number of investigators have described extrastriate projections from DGL in the 
cat (23, 44, 125, 137, 293). Although past studies had suggested that area 19 and 
a lateral suprasylvian area also received DGL projections (124, 137), recent studies 
by Burrows & Hayhow (44) have shown that these projections are limited to areas 
17 and 18. 

Since the knowledge of the evolutionary development of the geniculostriate sys- 
tem can be expected to reveal some wide ranging organizational principles of the 
visual system, several investigators have studied the thalamocortical projections in 
the visual system in such animals as the guinea pig (77), grey squirrel (215), tree 
shrew (95), opossum (17-19), platypus (62), and echidna (61). These studies have 
shown a remarkable consistency in the retino-geniculo-cortical scheme. Thus, the 
cat continues to be the only animal in which an enigmatic extrastriate geniculate 
projection has been identified. 

The observation of an anomaly found in the DGL of the Siamese cat (162) has 
led to a number of investigations demonstrating that the projection from the eye to 
lateral geniculate nucleus is also abnormal in this animal. As will be discussed in 
a later section, this abnormality is also reflected in the projections to visual cortex. 

Beyond the geniculocortical projections, an entirely new picture is emerging 
regarding visual projections to cortex. A number of investigators have shown that 
the pulvinar and lateralis posterior nuclei of the thalamus project extensively to a 
belt of cortex surrounding striate cortex (95, 153, 175, 290). These thalamic nuclei 
receive their visual input from the tectum (95, 153). Thus, there seem to be at least 
two pathways from eye to cortex, one from retina to geniculate to cortex and the 
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other from retina to tectum to posterior thalamus to cortex. The consequences of 
these new anatomical findings are already being felt in the interpretation of ablation- 
behavior experiments and may provide the key to understanding the mechanisms 
underlying residual vision in animals in which area 17 has been completely ablated. 

Although a number of workers have searched for extrageniculate input to the 
striate cortex, only one study reports such an input (19). At the time of this review, 
except for the cat (44) and the opossum (19), no anatomical studies have reported 
a thalamic input to striate cortex other than from the lateral geniculate. 

Application of the new silver stains continues to reveal details of neuronal orga- 
nization that are producing reevaluations of electrophysiological and behavioral 
studies. Special attention is being paid to the laminar distribution of inputs (18, 67, 
194, 213, 231) and the internal connections within cortex (357). 

In a further effort to characterize “visual cortex,” recent anatomical studies have 
also been focused on the distribution of efferents from striate cortex. It has been 
shown that fibers originating in the superficial three layers of cortex project down- 
ward to terminate in cortical layer 5 (377) and also in the adjacent area 18 (376, 
404). Deeper layers in area 17 project to the superior temporal sulcus and the 
pulvinar nucleus of the thalamus. Another subcortical target of obvious significance 
is the tectum, which receives afferents from areas 17 and 18 in the cat. There is 
evidence that area 18, in addition to sending fibers across the corpus callosum (195) 
and to tectum (129), also sends a small bundle of fibers to the pons (138). In the 
cat, area 17 sends efferents to 18, 18 back to 17, and both areas to the Bishop-Clare 
area and to area 19 (124, 443). Several studies report species differences in the 
cortical recipient zone of efferents from area 17 (376, 461). Thus, in Rhesus mon- 
keys, areas 18 and 19 receive area 17 input while in Saimiri only area 18 receives 
such input (376). 

Considerable interest has been shown in the projections of striate cortex to the 
thalamus as a possible feedback pathway (130, 173, 183, 184, 291, 292), especially 
since there is scant evidence for an efferent pathway to retina in mammals. Although 
it first appeared that only area 18 projected to the lateral geniculate nucleus (183), 
autoradiographic methods have now confirmed an efferent projection from area 17 
to the lateral geniculate in the cat (184, 291). 


Physiology 


Mapping of the visual world onto visual cortex in various animals has continued 
and now includes, for example, the hedgehog (165) and tree shrew (216). Since the 
cytoarchitectonic borders of area 17 have been found to coincide closely with the 
borders of the visual fields determined electrophysiologically in these animals as well 
as Others (167, 214, 216), it now appears that the anatomical borders of area 17 can 
be used as a guide to determine the extent of the field projection, and conversely, 
that the visual field map determined by electrophysiological methods can be used 
to delineate the extent of striate cortex. 

The investigation of response characteristics in single neurons of visual cortex 
continues unabated (e.g. 64, 255, 297, 380, 382). In addition to making some 
modifications of Hubel & Wiesel’s (188) classification scheme (e.g. simple, complex, 
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hypercomplex), numerous investigators have been making quantitative measure- 
ments of still other properties of single units (28, 255, 297). For example, Bishop 
and his colleagues (28, 177) in a series of papers have manipulated such parameters 
as the velocity of the stimulus, and the size, the orientation, and the extent of 
binocular input. Among the significant contributions made by this group is the 
introduction of a new technique called the “activated discharge” method (27). While 
the technique utilizes existing technology and some old ideas in neurophysiology, 
it has proved a valuable method in revealing new properties of cortical neurons. The 
activated discharge is produced by continuously stimulating a cell through one eye 
and attempting to influence it by another eye. This method has succeeded in demon- 
strating inhibitory inputs even in single units that have little or no spontaneous 
activity and thus, has revealed that a much greater percentage of cortical cells are 
binocular than was once thought on the basis of stimulation of each eye alone (177). 

Many of the cells respond in a very specific way to binocular input. Several 
investigators (e.g. 9) have described cells in striate cortex of the cat that give a 
maximal response when the two monocular receptive fields are in correspondence. 
Furthermore, it was found that some receptive fields are in correspondence even 
though they are slightly disparate on each retina. These results provide a firm 
neurophysiological substrate for stereopsis, e.g. depth perception through the use of 
retinal disparity (9, 27, 30, 318). 

The evidence for columnar organization in visual cortex derived from electro- 
physiological studies continues to accumulate and now appears to be overwhelming. 
For example, in addition to the columnar organization of preferred orientations 
shown by Hubel & Wiesel (188), Blakemore (30) has demonstrated columnar orga- 
nization for retinal disparity and visual direction and, as mentioned previously, 
Hubel & Wiesel have shown binocularity columns as well using anatomical methods 
(194). The extent of functional interactions between these columns is now a subject 
of considerable interest (32). 

While a great many details of cortical organization have been worked out using 
extracellular methods, the capability for describing excitatory and inhibitory in- 
teractions that intracellular recording provides is yet to be extensively explored. 
Creutzfeldt and his colleagues (16, 78, 296) have been the most active investigators 
using this methodology, but progress has been slow and the technical difficulties 
currently appear to mitigate widespread use of this method. 

Given the tools of electrophysiology and the considerable backlog of quantitative 
findings on single cell properties in visual cortex, a number of investigators have 
turned their attention to problems of perception in an attempt to ascertain if single 
cells can code some fundamental perceptual properties, such as masking (294) or 
stability of the visual world (86). One possible mechanism to account for perceptual 
stability would be changes in cell properties related to changes in head, body, or eye 
positions. Although such a mechanism may be difficult to reconcile with current 
theories of neural processing in visual cortex, there have been a number of reports 
(86, 186) in which receptive field orientation selectivity seems to be modified by a 
change in body position. 

To determine how receptive fields of cortical neurons are built up, the receptive 
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field properties of cells in the several cortical laminae have been compared. For 
example, the electrophysiological finding that some cells in layer 4 have receptive 
fields more like geniculate cells has been taken as evidence that these cells receive 
input from the lateral geniculate nucleus more directly than do cells with dissimilar 
fields (27, 28). 

Another line directed toward understanding the synthesis of receptive field prop- 
erties in visual cortex has been followed by the Australian group. These investigators 
have confirmed that two classes of neurons, termed transient and steady-state cells, 
project from the retina to the lateral geniculate nucleus, and they were able to follow 
this dichotomy of cell types from the lateral geniculate to cortex (182, 394). In 
general, the steady-state cells appear to project primarily to simple cells while the 
transient cells project primarily to complex cells. The functional significance of these 
findings has yet to be realized but it may be a large step forward in understanding 
the bifurcated input to cortex from DGL in the cat. 

How cells come to have the receptive field properties that they do is a question 
of great interest to all neuroscientists. One popular approach to the study of this 
problem is investigating the ontogeny of the properties of visual cortex neurons (11). 
The early experiments by Hubel & Wiesel (189--192, 441) have shown the develop- 
ment of functional amblyopia in monocularly deprived (65, 88, 298, 342) animals 
and a lack of binocularly activated cells in the visual cortex of monocularly deprived 
animals when tested as adults (189, 190, 365). These experiments have been ex- 
tended using a different method, induced squint, produced by sectioning one of the 
extraocular muscles. With this procedure, it has been found both in cats (190) and 
monkeys (299) that there is a reduction in the number of binocular cells in visual 
cortex (68), and an amblyopia results. The question of recovery from this defect with 
subsequent correction is still a matter of some controversy (299), but because of 
these findings, the normal development of interocular alignment has enjoyed re- 
newed interest (363, 364). 

Probably the most exciting finding in recent years has been the demonstration that 
selective visual experience can modify the receptive field properties of cells in the 
visual system of cats. For example, Hirsch & Spinelli (178-180, 381) and Blakemore 
& Cooper (31) have shown that young kittens exposed for a few hours a day to only 
one contour (e.g. either horizontal or vertical edges or lines) show a preponderance 
of cells sensitive to that contour as adults, and a dearth of cells responsive to 
contours of other orientations. Just as in the induced-squint and visual deprivation 
experiments, a rather clear behavioral deficit has been demonstrated (122, 178). 
These effects appear to be irreversible if the animal is not given normal visual 
experience during the critical period. These findings provide a firm experimental 
base for demonstrating the plasticity of the nervous system and susceptibility to 
environmental influences (350). It is not unreasonable to assume that a similar 
plasticity will be found in other systems as well when appropriate selective-experi- 
ence methods are applied. 

Coincident with the experiments on squint and monocular/binocular deprivation 
is an observation on the Siamese cat (162) mentioned previously. For many years 
it has been widely known that Siamese cats have a naturally occurring squint. This 
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squint now appears to be due to a genetic anatomical anomaly in the visual system 
which consists of a misrouting of the fibers at the optic chiasma (68, 76, 162, 193). 
This defect results in an intrusion of ipsilateral fibers in the contralateral field. 
Because the anatomical projections of the visual system in the normal cat are so well 
understood, it has been possible to predict the field defect in the Siamese cat due 
to its anomalous projection. Several investigators using single cell recording tech- 
niques in lateral geniculate nucleus and cortex have demonstrated the field intrusion 
that was predicted by the anatomical investigations (68, 163, 193). These findings 
raise a fundamental question regarding the treatment of squint in humans. Treat- 
ment has often consisted of surgical correction, yet these findings in the Siamese cat 
suggest that rather than a muscular imbalance producing a squint, a neuroanatomi- 
cal peculiarity within the visual system may be producing the abnormal] eye position. 
If this is true, the value of surgical correction of squint is questionable. 

Recent anatomical demonstrations of efferents from visual cortex to superior 
colliculus have provided impetus for a number of electrophysiological studies which 
have evaluated the role of these efferents in determining the receptive field properties 
of tectal cells (272, 349, 393, 440). These studies suggest that many tectal cells lose 
their orientation specificity after removal of striate cortex. 

On the afferent side, while there is scant evidence for extrageniculate input to 
visual cortex, numerous investigators have demonstrated inputs to the geniculocor- 
tical system from mesencephalic structures (117, 181, 253, 305, 371, 372), The exact 
route of these inputs has not yet been established anatomically, however. 

In light of these diverse approaches to the visual system and the plethora of 
published findings on visual system properties, it is not surprising that a number of 
workers have attempted to generate pattern perception theories utilizing visual 
neuron properties already known to exist (10, 79, 277, 319, 320). While the predic- 
tive validity of these sophisticated theories has yet to be demonstrated, they encour- 
age one to think that a fundamental understanding of the neural basis for some facets 
of visual perception may not be too far off. 


Behavior 


Ablation experiments in which visual cortex is removed and residual visual abilities 
are measured continue to be the major source of information regarding the func- 
tional role of visual cortex. Although the early observations by Kluver (223) of a 
loss of pattern vision in monkeys deprived of visual cortex have been confirmed 
numerous times, his findings of virtually complete pattern blindness must now be 
tempered. There appears to be a greater capacity for vision in monkeys without 
striate cortex than previously realized (196, 311), and in the cat, primate-like visual 
defects can only be produced if several cortical areas outside striate cortex are 
included in the ablation (82, 100, 379). 

Humphreys & Weiskrantz (196) have reported that monkeys deprived of striate 
cortex are capable of accurate spatial localization, e.g. normal orientation in space 
and a considerable amount of eye-hand coordination for grasping objects. Other 
residual visual capacities have also been described in striatectomized monkeys (311, 
356, 357). Since these results suggest some kind of parallel processing of visual 
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information, it is satisfying to note that anatomists have now provided several 
extrageniculate pathways that could participate in vision not mediated by striate 
cortex (e.g. 95, 153, 276). Thus, a number of investigators have suggested that a 
considerable amount of geniculocortical-like visual discrimination is possible at 
subcortical levels, e.g. superior colliculus (7, 205) and accessory optic system (311). 
Alternatively, the retinal-tectal-pulvinar-cortex pathway has been suggested as the 
route mediating these discriminations (95, 374). 

When striate cortex is ablated in other animals such as cats (100), rabbits (462), 
hedgehogs (166), rats (69, 73, 136, 378), and tree shrews (205, 418, 419), the deficits 
on pattern vision are not quite as clear cut. These animals, all nonprimates, appear 
to be capable of nearly normal visual behavior without striate cortex. Whether these 
animals are processing visual information in alternate pathways in the same way as 
it is processed in striate cortex, or are utilizing alternate mechanisms as well as 
alternate pathways is a subject of continuing controversy (222, 418). For example, 
it is not impossible that an animal deprived of striate cortex solves certain pattern 
discrimination problems on the basis of other cues (such as temporal intensity) that 
can be derived by scanning the visual display. 

While it may be possible for an animal to discriminate patterns through the use 
of a variety of visual cues, it has been thought his capacity to resolve fine detail is 
a unique property of striate cortex (22, 71, 223). Numerous investigators have 
demonstrated a dramatic reduction in visual acuity in animals deprived of visual 
cortex and have also shown that small lesions confined to known projection areas 
of cortex will produce reductions in acuity predictable from the field position of the 
defect (71, 345). One troubling finding is this regard has been reported by Diamond 
and his group: they found no reduction in the tree shrew’s capacity to resolve fine 
gratings after the complete removal of striate cortex (419). The tectal-puivinar- 
cortex pathway could mediate this capability (418, 419). 

The exciting findings by Barlow, Blakemore & Pettigrew (9) and Bishop (27) on 
the processing of binocular information in cat cortex (cf. physiology section) has led 
a number of investigators to analyze the role of striate cortex in depth perception. 
While there are yet very few reports on this question, it appears that the cat can 
use retinal disparity to make depth judgements (119) and that removal of striate 
cortex results in deficits in depth perception in cats (271), but may not in rats (39). 

The purely visual function of striate cortex has been little challenged by the 
findings of the last few years. There are, however, a few studies which indicate that 
striate cortex may also participate in a number of other activities that are not 
obviously visual [e.g. avoidance behavior (128, 249-251, 259, 313), audition (383)]. 
But because reports of nonvisual functions of visual cortex are still rare, their 
significance relative to the total role of striate cortex in behavior is difficult to 
evaluate at this time. 

The role of the corpus callosum in the integration of the two visual hemifields 
(126, 362, 412, 413) and hemisphere differences (344, 422) in visual behavior are 
subjects of great interest as are the effects of training and interoperative experience 
on postoperative performance on visual discriminations (33, 34, 136, 317). 
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AUDITORY CORTEX 


The idea that auditory cortex is necessary for auditory sensation is now all but 
buried in the past. Useful reviews of the anatomical, physiological, and behavioral 
experimentation leading to this conclusion can be found in Neff (284, 285, 288), 
Erulkar (110), Sachs (352), Elliott & Trahiotis (107), and Masterton & Diamond 
(268). 


Anatomy 


The locus and extent of auditory cortex has now been delineated in a variety of 
animals other than cat (e.g. 239, 241, 243), but information on some important cases 
is still apparently lacking. For example, cytoarchitectonic, electrophysiological, and 
thalamic projection techniques still disagree in their conclusions on the extent of 
auditory cortex in monkeys (cf. 37, 423). 

The search for tonotopic maps within auditory cortex, so typical of the 50s and 
early 60s, has now dwindled with the conclusion that although tonotopicity exists, 
it is neither as precise nor as prevalent as it is in lower auditory centers [e.g. see the 
contributions and bibliographies in Woolsey (455), Goldstein et al (152), Miller 
(273), and Purser & Whitfield (331), and the discussion in Sachs (352)]. Anatomical 
experimentation, whether electrophysiological or histological, has turned instead to 
the identification of subareas within auditory cortex and to particulars of their input 
and output. 

On the afferent side, it is already reasonably safe to conclude that there are at least 
two thalamocortical auditory systems, one projecting from the medial geniculate to 
an auditory “core” area of cortex and another projecting from part of the posterior 
group of thalamic nuclei to a “belt” of cortex surrounding the core (96, 154, 433). 
Whether there exist further important subdivisions within these areas in all mam- 
mals (as there certainly is in cats) or clear physiological and behavioral differences 
in the contribution of these two afferent auditory systems has not been determined. 

The efferents from auditory cortex have also been investigated with the new silver 
stains. In addition to showing intrahemispheric (91) and interhemispheric (92) 
projections, recent studies have found tracts descending from auditory cortex to 
basal ganglia, ventral thalamus, midbrain tegmentum, and pontine nuclei, as well 
as reciprocating pathways to the major auditory structures in thalamus and mid- 
brain (93). The latter pathways can be thought of as part of the descending auditory 
system (87, 332); but the pathways to basal ganglia, ventral thalamus, and tegmen- 
tum are hard to reconcile with the classical view of sensory cortex. 


Physiology 


The electrophysiological analysis of auditory cortex has begun to advance rapidly. 
Brugge (42, 43), Goldstein (149, 151, 152, 164), Whitfield (397, 432, 433), and their 
respective colleagues have gone far beyond questions of tonotopicity, regularly 
including monaural and binaural sounds, clicks, tone ramps, etc., in their battery 
of test stimuli. Others have included specific vocalizations (120, 395, 448, 449) or 
have recorded activity in awake or behaving subjects evoked by a variety of stimuli 
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(12, 208, 209, 354, 355). This expansion of test stimuli and experimental conditions 
reflects the fact that many cortical neurons are not responsive to tones of any 
frequency and also reflects an undercurrent of thinking that the passive neural 
responses to tones and their simple combinations may have been overemphasized 
in physiological experimentation in the past (268, 456). The argument has been 
made that auditory cortex is concerned more with the features of sound sources than 
it is with the physical features of sounds themselves (268, 456). This physical vs 
biological (or sounds vs sounds sources) dichotomy of auditory features has been 
prompted as much by the results of ablation-behavior experiments as it has by 
electrophysiological experimentation itself. 

Goldstein and his colleagues have explored the possible functional significance of 
the columnar organization of auditory cortex (1, 150), and also have succeeded in 
making intracellular recordings (336). The variety of response characteristics of 
single units (“on,” “off,” “on-off,” “through,” “groupers,” “lockers”), and the 
variety of “best stimuli” (clicks, noise bursts, swept tones, etc.) seem to be very 
individualistic characteristics of cortical neurons that currently defy quantification 
(336). But it has been shown that most of these characteristics are the result of 
convergent interaction somewhere in the pathway from ear to cortex rather than in 
cortex itself. Therefore, it can be concluded that some features of the stimulation 
are being extracted at successive synapses in the auditory pathway. However, the 
best way to characterize these “features” is far from settled, and no one theoretical 
viewpoint has yet begun to predominate. 


Behavior 


A number of ablation-behavior experiments on auditory cortex have been performed 
recently. Neff’s reviews (285-288) of this literature through the late 60s reveal the 
reasons why research has drifted away from strictly psychophysical experimentation 
on tones. Summarizing the earlier work, it is reasonably safe to state that ablation 
of auditory cortex does not affect the capacity to discriminate sound vs silence, nor 
the intensity or frequency of tones in any mammal. Only sound localization and 
lateralization and certain complex (tonal patterns) or naturalistic discriminations 
(voice sounds) seem to be permanently and severely disrupted by cortical ablation 
(e.g. 70, 89, 90, 94, 267, 341). 

The lack of involvement of auditory cortex in the discrimination of the physical 
elements of sound (intensity, frequency) and its clear involvement in the biological 
elements (Sound localization, species-specific calls, etc.) has been demonstrated in 
a variety of mammals ranging from the primitive opossums (334, 335) to the highly 
specialized cats (e.g. 45, 75, 220, 303, 434) and monkeys (266, 398). But the results 
of the most recent of these experiments seem to show that even with these discrimi- 
nations, the deficit resulting from auditory cortex ablation is more “audito-motor” 
than “auditory.” That is, even though the deficit is confined to strictly auditory 
discriminations, the particular behavioral response required of the animal may be 
a crucial factor in determining whether or not the discrimination is disrupted after 
ablation (176, 268). 
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SOMATOSENSORY CORTEX 


Since Woolsey (454) showed that potentials evoked by somatosensory stimuli could 
be recorded from motor cortex and that movements could be evoked by stimulation 
of somatosensory cortex, the classic distinction between somatosensory cortex and 
motor cortex has been blurred. This fact is now often alluded to by use of the terms 
“Sm” and “Ms” in referring to what was once called somatosensory and motor 
cortex, respectively—the lower case letters reminding the reader that somatosensory 
cortex 1s motor too, and vice versa. Recently, research has been directed to discover- 
ing and describing features that distinguish the two areas and indeed, such features 
have been found. However, the idea of two relatively independent homunculi, one 
representing the dermatomes, the other controlling the myotomes, 1s probably 
buried forever. Therefore, the studies cited here and those cited under "Motor 
cortex” should be grouped together in any consideration of the subject. 


Anatomy 


The mapping of somatosensory cortex in mammals has almost come to an end with 
the conclusion that in primitive and advanced mammals alike, the body surface is 
represented at the cortex in an order-preserving map with more cortical surface 
given over to body areas that are heavily innervated. Therefore, in this most basic 
sense, the pioneering conclusions of Adrian have been verified, and the topology of 
the somatosensory projection has been satisfactorily explained (2, 270, 329, 330, 427, 
429, 430, 435-438). Of the many strictly anatomical questions still remaining, the 
significance of the columnar and laminar arrangements (426) and of the fine struc- 
ture within the cortex itself (314), and the question of which peripheral fiber popula- 
tions do and which do not gain cortical representation are now in the forefront (e.g. 
20, 80, 81, 121, 168, 169, 302, 312). Luckily, most recent discoveries have tended 
to simplify rather than to complicate the anatomical picture (eg. 155). Landgren & 
Silfvenius’ (232-234, 367-369) elaborate verifications that some peripheral Group 
Ia afferents (afferents from muscle spindles) gain cortical representation has shat- 
tered the old rule that proprioceptive activity does not reach the cortex directly. 
Similarly, the localization of a cortical area activated by stimulation of taste fibers 
has removed still another exception from the discussion (121, 351). 

But one discovery in particular will probably have far reaching consequences for 
future research into somatosensory cortex. It is now known that the dorsal column 
and the anterolateral spinothalamic systems are not the only spinal somatosensory 
pathways. There is at least one other system (spinocervical) which is as direct a 
pathway from body to cortex as the others and possibly more so (e.g. 106, 123, 359, 
417). This discovery may go a long way in explaining the persistence of fine discrimi- 
nation after lesions of dorsal column, medial lemniscus, or somatosensory cortex 
(359). 


Physiology 


The electrophysiological investigation of somatosensory cortex is just beginning to 
become concerned with “feature extraction” as that term has been popularized by 
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vision research (439). More usual as topics are its interactions with motor and 
association cortex (36, 197, 281, 343) and the relation of neural activity to conscious 
experience (verbal report) in humans. 

This latter line of research has been reviewed by Libet et al (246-248) who have 
uncovered several striking features of somatosensory sensation in humans including 
the facts that stimulation in somatosensory cortex must be persistent to evoke 
consciousness, that different patterns of cortical stimulation evoke different qualities 
of sensation, and that cortical stimulation is about equally likely to evoke natura! 
qualities as paresthesia. Other physiological reviews of somatosensation can be 
found in Hyvärinen (197, 198) and in the books edited by Kenshalo (221), by 
Kornhuber (see 248), and by Somjen & Nashold (see 247). 


Behavior 


The difficulty in controlling somatosensory and kinesthetic stimuli in moving ani- 
mals continues to plague behavioral research on somatosensory cortex. Unlike 
vision or audition, most behavioral experiments on the somatosensory system are 
forced to rely on qualitative judgments of abnormalities, or on measurable deficits 
in relatively complex motor activities, with the result that progress has been slow 
and painstaking. Among the more informative series of recent experiments are those 
by Semmes & Mishkin (396a) in monkeys, by Berkley (21) and Glassman in cats 
(131-134), and by Finger in rats (118). Perhaps most revealing for the present topic, 
however, is the fact that most authors continue to describe the deficits, or the lesions, 
as sensorimotor instead of either sensory or motor. This choice of terms clearly 
reflects the difficulty in segregating the two systems in the traditional fashion. 


MOTOR CORTEX (AREAS 4, 6, AND 8) 


The classical view of motor cortex holds that area 4 is the origin of a precisely 
somatotopic corticospinal or pyramidal tract and gains most of its input from area 
6, while area 6, in turn, gains its input from the association cortex surrounding it. 
It follows from this view that unilateral excision of area 4 should result in perfect 
hemiplegia, and, further, that because of the clear motor homunculus in area 4, a 
punctate lesion should result in paralysis for voluntary movement in one small 
muscle group. Perhaps more than others, the early work of Tower (405), of Travis 
& Woolsey (409), and especially of Glees & Cole (135) foreshadowed the difficulties 
that the old view would meet. But where research into motor cortex was once 
confined mostly to electrophysiological mapping and gross ablation-behavior experi- 
ments, it now includes highly sophisticated electrophysiological, neuroanatomical, 
and behavioral techniques and their several combinations. It will be seen that these 
more refined techniques have now rendered the classical view almost untenable in 
all animals except, possibly, humans. 


Anatomy 


The experimental silver-staining techniques have shown that area 4 gains a great 
deal of input not only from area 6 but also from the adjacent somatosensory cortex 
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(areas 1, 2, 3, and 5), and from the ventrolateral nucleus of the dorsal thalamus. 
Area 6 gains input from almost all other areas of cortex and also from the thalamus 
(e.g. 116, 340, 389-392). Perhaps more surprising are the demonstrations by electro- 
physiological recording of short-latency potentials evoked in motor cortex neurons 
by stimuli in almost every modality (8, 35, 66, 105, 147, 148, 210, 245, 264, 265, 
347, 399, 415, 428, 431). In most mammals, therefore, the projections to areas 4 and 
6 do not serve well to differentiate them from each other or to differentiate them 
from association, or even sensory, cortex. 

The output of areas 4 and 6, long known to include the corticospinal tract, also 
includes recurrent fibers to the sources of their input, fibers to the basal ganglia, and 
longer fibers to the brainstem motor and sensory nuclei (227, 229, 263, 278, 279). 
It is these last fibers, those projecting from “motor cortex” to sensory neurons at 
lower levels that have come as a surprise and have opened the possibility that the 
cortical motor system might have been derived from, and maybe still is, a descending 
somatosensory system with a strictly motor component only recently acquired in 
a few specialized mammals. 

In a long line of neuroanatomical studies Kuypers and his colleagues (226-230) 
have shown that the corticospinal tract, once considered the chief output of cortex, 
and thought to extend from cortex directly to motor neuron in the spinal cord, is 
a rare exception, not the rule, among mammals. In most mammals, the corticospinal 
tract hardly exists, in the sense that it does not extend further than the uppermost 
segments of the spinal cord and synapses almost entirely on somatosensory and 
internuncial neurons rather than on motor neurons. Only in mammals that have the 
ability to move their digits independently, (e.g. anthropoids, racoons, phalangers) 
does the corticospinal tract penetrate deeply into the cord or synapse on motor 
neurons (230, 261, 262, 316, 337). 

Woolsey and his colleagues have shown repeatedly that both motor cortex and 
somatosensory cortex have both motor and somatosensory characteristics (454). 
This overlap of function seems to have an obvious evolutionary history since, in 
primitive mammals, sensory and motor maps are nearly perfectly overlapped (240, 
242) while in anthropoids they are much more segregated. 

If the converse question is raised—what is the extent of the cortical area of origin 
of the pyramidal (or peduncular) tract?—a similar conclusion is reached. In primi- 
tive mammals, almost the entire neocortex (excepting possibly area 17) must be 
ablated before the fiber degeneration in the peduncles or pyramids is complete, while 
in man the degeneration is almost complete after excision of area 4 alone (204, 206, 
207). Therefore, a definition of motor cortex based on its efferents is little better than 
the one based on its afferents. In most mammals either definition is much too 
inclusive and certainly more inclusive than a cytoarchitectonic definition. 

Because of the dense fiber connections of areas 4 and 6 with the basal ganglia and 
the thalamus (which in turn receives cerebellar imput via the red nucleus), even the 
distinction between pyramidal and extrapyramidal motor systems has begun to blur 
(e.g. 116). Certainly the heavy connections between cortex and basal ganglia permits 
motor cortex to be thought of as the origin of the extrapyramidal system as properly 
as it is the origin of the pyramidal system. 


NEOCORTEX AND BEHAVIOR 289 


Physiology 


Although the mapping of motor cortex in a variety of mammals continues to be 
useful for comparative purposes, the greatest advance in the physiology of motor 
cortex, and the pyramidal tract, specifically, has come from the laboratories of 
Evarts (112-115) and of Porter (185, 187, 322-326). Evarts recorded from single 
neurons in the motor cortex of awake monkeys while they were performing a 
conditioned movement of the wrist. By this means the activity of the cortical 
neurons could be correlated with the overt movements of the animal and the timing 
and coding ofthe neural activity analyzed in behavioral terms. It is satisfying to note 
that this line ofresearch has shown clear and important differences in the pyramidal 
tract cells of motor cortex as opposed to nonpyramidal tract cells and cells in other 
areas (113, 115). Activity in the neurons in motor cortex precede movements (by 
up to 100 msec), while activity in cells in either somatosensory cortex or in the 
frontal eye fields (area 8) follow movements (29, 115). Furthermore, the activity in 
the motor neurons is correlated with the displacement or position of limb or eye 
(113). 

Porter and his colleagues (185, 187, 322, 325, 326) have verified Evarts’ conclu- 
sions and have also described the effect of motor cortex activity on spinal motor 
neurons (185, 324). Still others have come to the same conclusion as Evarts using 
other responses (225, 252) or other techniques (83, 84). Thus, it would seem that 
some neurons in motor cortex do indeed play a special role in the initiation of some 
movements. However, because these characteristics are observed only in neurons 
whose axons traverse the pyramids and we have already noted that anthropoids are 
among the very few mammalian varieties that possess a corticospinal tract in the 
Strict sense, it would not be surprising if such a marked difference between motor 
and sensory cortex does not prevail among mammals in general. 

Particularly interesting for the present purpose, however, are the studies of 
O’Brien & Fox (300, 301) and by Woody and his colleagues (109, 450-453) who 
have investigated neural activity in motor cortex during conditioning. This work has 
suggested that conditioning produces a relatively permanent change in the response 
characteristics of some motor cortex neurons as well as in behavior. By stimulating 
the same neurons directly, it appears that the motor neurons themselves are changed 
as a result of conditioning and are not just reflecting changes that have taken place 
in neurons elsewhere. 


Behavior 


The most recent ablation-behavior experiments on motor cortex or the pyramidal 
system have been confined with three exceptions (13, 204, 337) to cats and monkeys. 
In each case, as it has been in the past, the behavioral observations are broad-ranging 
and almost completely qualitative (e.g. “unnatural posture,” “subnormal tone,” 
etc.). But the one common result of this line of inquiry is the surprising number of 
actions that are not lost to an animal as a result of massive cortical removals. In 
general the degree of effect of a cortical or pyramidal lesion varies from what appears 
to be little or nothing in primitive mammals to sharply defined contralateral paral- 
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ysis in humans (204, 206, 207, 411) with intermediate effects in monkeys (14, 40, 
237, 238, 416). Whether this gradient of effect parallels the differences in the devel- 
opment of the pyramidal tract or whether the loss just seems to be greater in 
anthropoids because they have more to lose to begin with, is far from decided. What 
is clear, however, is that the contribution of motor cortex can no longer be discussed 
for mammals in general—so far, the differences in its role obscure the similarities. 


POSTERIOR ASSOCIATION CORTEX 


Of the posterior cortical areas classified as “association,” those concerned with 
vision in the temporal and occipital lobes have been the most extensively studied 
and have undergone the most significant change in the conceptualization of their 
function; while those located in the parietal lobe have attracted only scant attention 
(54, 55, 58, 59). Three excellent reviews have appeared in recent years, one anatomi- 
cal (459), one functional (276), and one a comprehensive review of all aspects of 
visual association cortex (157). 


Anatomy 


The cortical areas most often included in the visual “association” category are: (2) 
inferotemporal cortex (area 20); (6) prestriate cortex (including areas 18 and 19); 
and (c) a transition zone between these two regions in the occipital-temporal Junc- 
tion (157). Relatively little recent work has been done on the cytoarchitecture of 
these areas in primates; most workers rely on the classical studies by von Bonin & 
Bailey (37), and the older literature. However, several recent studies have been done 
in nonprimates, e.g. cat (174, 175, 353), tree shrew (95), and hedgehog (165, 166, 
214). These studies indicate that it has not been possible to distinguish homologs 
of all of the primate areas on the basis of cytoarchitecture alone: many workers rely 
on similarities in other anatomical features (e.g. efferent and afferents) to make 
cross-species comparisons. It is interesting that some current boundaries of visual 
association cortex in monkeys have been established primarily through ablation 
studies (276) (cf. last section). 

Numerous studies of the afferents to visual association cortex have confirmed the 
existence of cortico-cortical inputs from striate cortex via circumstriate belt or 
prestriate cortex (74, 460, 461). These studies have revealed a characteristic of 
prestriate cortex hitherto unknown: the ventrolateral prestriate area has at least 
three separate foveal projections (74, 458). The only other major cortico-cortical 
inputs to this area yet known are from the contralateral inferotemporal cortex and 
the ventral potion of dorsolateral frontal granular cortex (157). 

The subcortical inputs to inferotemporal cortex consist of a major input from the 
lateral pulvinar nucleus of the thalamus (72, 157, 159, 339) while the subcortical 
input to prestriate cortex derives primarily from the inferior pulvinar (6, 157, 200, 
201). Studies of visual association cortex in the cat describe essentially the same 
pattern of inputs with the major exception that prestriate cortex also receives an 
input from DGL (44, 124, 137, 290, 291, 293). 
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Efferents from these prestriate areas have also been described (174, 366), and, on 
the basis of these efferents, a cortical region in the cat anatomically comparable to 
inferotemporal cortex in the monkey has been discovered (353). Efferents from 
inferotemporal cortex project to circumstriate belt cortex, the frontal lobe, basal 
ganglia, dorsal medial, and pulvinar nuclei of the thalamus, and the superior col- 
liculus (157, 375). Prestriate cortex sends efferents to contralateral prestriate cortex 
(275), pulvinar, inferotemporal cortex (276), and in the cat, to the lateral pontine 
nucleus (41, 138). 


Physiology 


Because of its anatomical distance from visual receptors, it has been very difficult 
to study visual association cortex with electrophysiological methods (282), but 
topographic maps have been made (4, 5). Several evoked potential studies have 
attempted to relate electrical changes to behavioral changes, e.g. during visual 
discriminations (127, 146), or under various arousal states (157), etc. The recorded 
potentials are extremely labile and, thus far, have not been found to be consistently 
related to any one particular stage of visual learning (157). 

Single cell recording has been more successful. Recordings have been made in 
areas 18, 19, and the Bishop-Clare area (304, 406, 457), and in the middle suprasyl- 
vian gyrus in the cat (101, 102, 407, 408). In general, the receptive fields of cells in 
these areas appear to have more complex properties than ceils in striate cortex. This 
finding has been interpreted by some as representing a still higher stage in the 
theoretical hierarchy of receptive fields as proposed by Hubel & Wiesel, but the 
usefulness of this idea is yet to be demonstrated. Similar recordings have been made 
in area 18 of the monkey, but unlike the cat, cells sensitive to retinal disparity are 
found only in area 18 (191). 

Recordings made in inferotemporal cortex of monkeys (15, 104, 158, 160, 161) 
have demonstrated the existence of extremely complex and surprisingly specific 
receptive fields. So specific are these receptive fields that Gross et al (160) were never 
certain that they had found the unique properties of these cells. Another unusual 
feature of inferotemporal cells is the large size of the receptive fields and the 
invariable inclusion of the fovea. Bender et al (15) have also found that cells in 
inferotemporal cortex depend heavily on the striate-prestriate pathway for their 
activation. These results have been taken as evidence for still further hierarchical 
elaboration of receptive fields as suggested by the Hubel & Wiesel model. In addi- 
tion, these results have clearly established a role for the visual “association” areas 
much more closely related to sensory function than had been earlier imagined. 


Behavior 


Ever since the classic studies of Kluver and Bucy on the effects of bilateral removal 
of temporal lobes in monkeys, it has been known that visual information is somehow 
processed there. Later studies showed that it was inferotemporal cortex (rather than 
structures lying under it) that produced the visual deficits (e.g. 201, 275). In general, 
these deficits have been described as deficits in the retention and learning of complex 
visual discriminations (47, 48, 50, 51, 111, 328, 338, 443-445, 447). No deficits have 
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been found on any fundamental visual dimension, e.g. absolute threshold, visual 
acuity, flicker thresholds, nor in visuo-motor behavior following inferotemporal 
cortex ablations (157, 360). 

The question of whether inferotemporal deficits can also be produced by ablation 
of a major input area (prestriate cortex) has only recently been resolved (275). 
Whereas earlier papers had reported little or no change in visual discrimination 
performance following prestriate lesions (e.g. 328, 459), recent studies in monkeys 
in which the newly discovered ventrolateral prestriate areas (foveal prestriate areas) 
were ablated produced dramatic deficits comparable to, if not more severe than, 
inferotemporal lesions themselves (72, 159, 219, 257, 258). 

In contrast, destruction of the pulvinar, another major input to this cortical 
region, does not produce a significant behavioral deficit (276). 

Even though the deficits resulting from inferotemporal ablation are purely visual, 
there are several conditions which vitiate the defect. The defect is reduced (a) if the 
lesions are made in young animals; (5) if the animals are preoperatively overtrained 
(157); and (c) if shock is used to punish errors (256). 

Similar experiments in cats have not been performed partly because the locus of 
a homologous or at least analogous cortical zone is unknown. Nevertheless, in the 
few experimental animals with ablated extrastriate areas, only mild deficits have 
been found (e.g. 170). Ablations of area 20 in the cat have yet to be done. 


FRONTAL CORTEX 


Despite 40 years of careful and ingenious experimentation on the role of prefrontal 
or granular frontal cortex, this area of the brain continues to be one of the most 
puzzling. The starting point of most recent research can be found in the comprehen- 
sive volume edited by Warren & Akert (420) published in 1964, and contributions 
to a 1971 symposium in Poland on the same subject are beginning to appear in Acta 
Neurobiologiae Experimentalis. But because of the large amount and diversity of 
research activity in this area, many papers and certainly most reviews are out of date 
by the time they are finally published. 


Anatomy 


Great strides toward the description of the myriad connections of prefrontal cortex 
have been made with the Nauta-Gygax and Fink-Heimer silver stains. Where once 
the afferents to prefrontal cortex were thought to be adequately described after 
mention of the projections from the medial dorsal nucleus (MD) of the thalamus 
and a general reference to cortico-cortical projections from temporal and parietal 
lobes, it is now known that most of the subareas of prefrontal cortex (areas 9, 10, 
11, 12, etc) have a unique combination of afferents and efferents, most of which are 
order-preserving (24-26, 98, 212, 218, 244, 283, 307, 308, 310, 315, 370, 410, 414). 
Thus, the idea that prefrontal cortex is a large field relatively homogeneous in 
cytoarchitecture or afferent and efferent connections can no longer be retained (346). 

Of the many newly discovered afferents perhaps the most surprising are those that 
project to orbital prefrontal cortex (via MD) from olfactory cortex (108, 283, 327, 
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361) or from cells near the gustatory nucleus solitarius of the hindbrain (63). The 
presence of these fibers means that a part of the prefrontal cortex 1s much closer to 
the nose ( in terms of the number of intervening synapses) and, possibly, to the taste 
buds, than was ever suspected before. Perhaps even more startling, prefrontal cortex 
is closer to the nose than auditory cortex is to the ear. Although no one has yet 
suggested that orbital frontal cortex be called olfactory sensory neocortex, these 
facts certainly suggest that prefrontal cortex as a whole may fall somewhat short 
of being the human “organ of civilization.” 

On the efferent side, similarly important discoveries have been made. Again 
depending on the subarea of origin within prefrontal cortex, studies have demon- 
strated heavy projections to the striatum, to cingulate cortex, and to as far as the 
tegmentum in the medulla, in addition to pathways reciprocating the heavy afferent 
pathways from the thalamus and the parietal and temporal lobes (e.g. 41, 98, 211, 
212, 217, 260, 283, 289, 307, 310, 327, 361, 410, 414). Thus, prefrontal cortex 
discharges directly into the extrapyramidal system (suggesting a somatic motor role) 
and into Papez’ circuit (suggesting a visceral motor role). When this array of outputs 
is considered along with the olfactory and gustatory input and the temporal and 
parietal lobe input, it is difficult to discover a single term that might properly convey 
the range of possible functions of this area. Certainly, the term “association” does 
not help. 


Physiology 


Because it was long thought that prefrontal cortex was synaptically remote from 
either Sensory or motor neurons (and therefore electrically “‘silent”), few of the usual 
electrophysiological recording or stimulation experiments have been done for pur- 
poses other than anatomy (103). Instead, electrophysiological and behavioral tech- 
niques have been combined so that either neural activity is recorded while an animal 
is performing one or another task known to depend on prefrontal cortex (224, 295), 
or electrical stimulation is used to disrupt or facilitate the same behavioral activities 
(348, 384, 386). It is encouraging to note that some single units have been found 
whose activity is correlated with various stages of performance of prefrontal tasks 
and that stimulation certainly disrupts and, in some cases, possibly facilitates the 
same tasks. 


Behavior 


A relatively large number of essentially behavioral experiments on prefrontal cortex 
continue to appear each year. With a few exceptions (e.g. 358) they are derived more 
or less directly from the deficit in spatial delayed response known to result from 
prefrontal damage (202). In general, these experiments follow at least one of three 
directions: either they attempt to localize the subsystem within the prefrontal system 
(MD-prefrontal-caudate) whose damage is responsible for the classic delayed- 
response deficit, or they attempt to find other tasks that are disrupted, or they 
attempt to catalog the similarities and differences in prefrontal function between 
primates and nonprimates. 

Perhaps the most fundamental functional subdivision within prefrontal cortex 
appears to be between the dorsolateral convexity on one hand and the orbital or 
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ventrolateral convexity on the other. Recent studies converge on the idea that 
delayed-response tasks are more sensitive to dorsolateral lesions than to orbital 
lesions, while response reversal and other diverse tasks are more sensitive to orbital 
lesions than to dorsolateral lesions (cf. 46, 49, 52, 141, 145, 156, 172, 199, 254, 274, 
387, 388). In addition, monkeys with orbital lesions suffer deficits in social behavior 
not suffered by monkeys with dorsolateral lesions (38, 53, 85, 373, 396). There is 
some evidence that the orbital region can be further subdivided into lateral vs 
posteromedial with lesions in the former resulting in the response reversal deficit and 
lesions in the latter resulting in the deficit in social behavior. 

Similarly, spatial delayed-response seems to be most sensitive to lesions in the 
vicinity of sulcus principalis and within this region possibly to lesions surrounding 
the middle third of the sulcus (e.g. 56, 57, 385). Furthermore, both the spatial and 
delay factors, in combination, must be present in the task before a deficit can be seen 
(145). 

This dorsolateral vs orbital subdivision of anthropoid prefrontal cortex may also 
help to explain the differences between primate and nonprimate deficits after pre- 
frontal lesions. Tissue homologous to prefrontal cortex (i.e. projection area of the 
thalamic nucleus, or medialis dorsalis), has begun to be located in nonprimates, 
usually on the ventral and medial surfaces of the frontal pole instead of the lateral 
surface (49, 244). In terms of connections and MD cytoarchitecture, nonprimate 
prefrontal cortex seems to be more like the orbital frontal cortex and less like the 
dorsolateral cortex of primates (3, 283), In parallel, the symptoms resulting from 
lesions in nonprimates may be more like those resulting from orbital lesions and less 
like those resulting from dorsolateral lesions in primates (60, 97, 99, 420, 421, 
442). 

The possibility that nonprimate prefrontal cortex is homologous to only part of 
primate frontal cortex seems to be upheld by ontogenetic studies (139, 140, 142-144, 
156, 171, 172, 400, 424), which have begun to describe the limits of plasticity in this 
region, and by phylogenetic (269) studies. The expansion of prefrontal cortex in the 
primate line began only after the primate lineage separated from the other mam- 
malian lineages and a large part of its total expansion seems to have been due to 
the appearance of what is now identified as the parvocellular MD-dorsolateral 
division. If this proves to be the case, it follows that dorsolateral cortex almost 
certainly does not play the same role in nonprimates as it does in primates and might 
well be unique to primates; more consistency among mammals, primate or not, 
might be found in the anatomy and functions of the orbital division. 


SUMMARY 


In the context of this review we have attempted to illustrate that the three-way 
division of cerebral neocortex into areas labelled “sensory,” “motor,” or “associa- 
tion” has lost most of its original usefulness. Almost every report cited here either 
demonstrates, suggests, or refers to, nonsensory functions in sensory areas, nonmo- 
tor functions in motor areas, or nonassociative functions in association areas; and 
some reports refer to this issue explicitly (e.g. 401-403). 
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In surveying the reports, one gains the impression that very few workers in the 
field still use the classical terms to imply function; rather, they have been progres- 
sively restricted in their usage such that now they are little more than an anatomical 
shorthand. It can be anticipated that because of their anatomical usefulness the 
terms will continue to be employed at least until research has suggested an alterna- 
tive of comparable generality. Until then, it can only be hoped that these outmoded 
terms will not be taken seriously by those outside the field, nor, especially, by new 
generations of students. 
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INTRODUCTION 


Organization development (OD) has emerged both from the demands of a changing 
environment and from knowledge provided by the evolution of the applied behav- 
ioral sciences. Rapid changes within organizational environments have demanded 
organizational processes and structures which are far more flexible and responsive 
than traditional bureaucratic structures (22, 112). There have also been dramatic 
shifts in the social environment—-in the life styles, needs, and values of the human 
working force. Increase in professionalization, educational level, and ease of mobil- 
ity have decreased organizational loyalty and dependency. There have been efforts 
to challenge authority, a gradual shift from organizational relevance to personal 
relevance (66), and a trend toward preference for collaborative rather than hierar- 
chical roles (131). Much of the alienation and disenchantment in our current society 
is attributed to the institution of work (139). Work is seen as an arena in which 
considerable leverage can be exerted to improve the quality of life. 

Concurrent with and in partial response to these changes in the environment, a 
number of developments have occurred in the behavioral science disciplines. These 
include the realization that a variety of social and psychological factors affect work 
performance (127); the discovery that group decision-making affects personal in- 
volvement, motivation, and commitment (118); findings that participation increases 
ownership of change (56); and the use of groups as agents of change in organizations 
(48). T-groups and sensitivity training began in the late 1940s as an educational 
method which focused upon group processes and group dynamics (37). Approaches 
to sociotechnical systems (162) and work motivation (98, 99) have more recently 
incorporated the organizational tasks and social structures as important dimensions 
of organizational change. With a few major exceptions (15, 112, 113), contemporary 
organization theory has not yet contributed heavily to the field of organization 
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development. Its prime concern is in describing, analyzing, and theorizing about the 
status quo of the organization rather than the planned change processes which 
might improve its condition. 

Today OD is emerging as a field of interdisciplinary academic study and as a 
recognized profession. Increasing numbers of practitioners are employed by orga- 
nizations. There is an accrediting agency for OD practitioners? plus several profes- 
sional organizations with OD divisions. Although research and theory have 
seriously lagged behind the practice of OD, a number of original books devoted 
primarily to OD have appeared in the past 5 years: Argyris (15, 16); Golembiewski 
(86); French & Beli (75); Beckhard (20); Bennis (23); Blake & Mouton (27); Law- 
rence & Lorsch (113); Schein (145); Walton (168); Fordyce & Weil (73); Lippitt 
(119). In addition, several edited collections on OD have appeared: Bennis, Benne 
& Chin (24); Hornstein et al (101); Burke (41); Burke & Hornstein (42); Margulies 
& Raia (124); Schmuck & Miles (147). Review chapters and papers are now in 
progress which attempt to review a number of studies and to suggest frameworks 
for understanding organizational change (5, 21, 84, 159). 

The framework we shall use in reviewing OD views organizations as composed 
of people with different sets of values, styles, and skills; technologies with different 
characteristics; and processes and structures which reflect different kinds of relation- 
ships between people or between people and their work (114). Processes and struc- 
tures are the integrating mechanisms for (a) applying human resources to 
technological processes for task accomplishment, and (b) facilitating the utilization 
of technological processes for human fulfillment. From this perspective OD is a 
method for facilitating change and development in people (e.g. styles, values, skills), 
in technology (e.g. greater simplicity, complexity), and in organizational processes 
and structures (e.g. relationships, roles) (80). The objectives of OD generally can be 
classified as those optimizing human and social improvement or as those optimizing 
task accomplishment or more likely as some (often confused) blend of the two. 

Our concept of organization development calls for change in technology and 
structure (technostructural) or change in individuals and their interaction processes 
(human-processual), rather than efforts to change only the people, only the struc- 
ture/process, or only the technology of the organization. The rationale behind this 
decision is twofold. First, the organizational processes and structures are the major 
linkage between the human and technological inputs into the organization; process 
and structure are thus core concepts of the organization as a system. Second, much 
of the literature on change indicates the relative impotence of efforts to change only 
technology (e.g. industrial engineering, operations research, and scientific manage- 
ment approaches), only people (e.g. selection methods, sensitivity groups, and most 
training), or only the structure (e.g. rearrangements and reorganizations based only 
on structural elements). For example, strong resistance or failure is often encoun- 
tered in efforts to change only the organization structure (59), only the technology 
of the organization (163), or only the individual (46, 47, 71, 81). 
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The framework for our discussion of OD is pictured in Figure 1. An organization 
includes people, technology, and process/structure. These components interact in 
the technostructural and human-processual systems on behalf of the objectives of 
human fulfillment and task accomplishment. Although we have not pictured it in 
Figure 1, the environment obviously interacts with all of these components in terms 
of input to and output from the organization. 
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Figure 1 Approaches to organization development 


As indicated in Figure 1, the human-processual and technostructural change 
approaches converge at the interface of the organization process and structure. Both 
process and structure are concerned with authority, communication, decision mak- 
ing, goal setting, and conflict resolution. But process implies the implementation of 
these as dynamic behavioral events and interactions, whereas structure describes 
these as ongoing sets of durable roles and relationships. Attitudes and behavior are 
clearly affected by both process (130) and structure (137). Over time, structures can 
be changed by processes that are inconsistent with them and processes are con- 
strained and facilitated by organizational structures. Since process and structure are 
embedded in each other, it is almost impossible to create lasting change in one 
without modification of the other. Yet there are those who focus on changing 
organizational structures with no involvement in the behavioral processes in which 
these structures are embedded, and those who are totally concerned with changing 
processes, oblivious of the ongoing structures which underlie these processes. 

If organizations can be viewed as composed of three components (people, technol- 
ogies, and structures), so also can OD itself be understood by this model. It is at 
once a Set of personal values, a set of change technologies, and a set of processes 
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or structures through which the change agent relates to the organizational system. 
Thus OD interventions can be described in terms of the degree to which they 
incorporate values of humanism, democracy, economics, and science; the degree to 
which they utilize such technologies as team building, job design, and survey feed- 
back; the degree to which they incorporate collaborative or unilateral relationships 
between change agent and organization; and the degree to which they are intended 
to optimize human/social benefit or productivity/performance objectives. 

The values of the more humanistic and democratic approaches to OD are stated 
rather explicitly in the literature. Many of these stem from the Theory Y assump- 
tions of McGregor (128). Tannenbaum & Davis (160), for example, clearly state 
these as values in transition in society: toward a positive and confirming view of man 
as a whole person, in process; toward encouraging the effective expression of feel- 
ings, authenticity, risk, confrontation, and collaboration. These values are similar 
to the humanistic-democratic values expressed by others (22, 86, 124). 

The objectives of OD are obviously colored by these values. They include creating 
an open problem-solving climate, supplementing the authority of role and status 
with the authority of knowledge and competence, locating decision-making and 
problem-solving as close to information sources as possible, building trust and 
collaboration, developing a reward system which recognizes the organizational 
mission and the growth of people, helping managers to manage according to relevant 
objectives rather than past practices, and increasing self-control and self-direction 
for people within the organization (42, 155). Others see OD objectives as increased 
problem-solving capacity (15, 155), cultural change (42), or re-education (86, 174). 

The relationships of OD practitioners to organizational systems vary from 
facılitative-collaborative to unilateral-directive. In the latter, the change agent gath- 
ers data, analyzes it, reports his findings to senior management, and recommends 
changes. In the more facilitative role, the change agent helps others engage in 
exploration, diagnosis, and development. Phases in the facilitative-collaborative 
process include helping the organization identify problems, set priorities, develop 
data, plan action strategy, test and implement alternatives, and evaluate the change 
effort (20, 109, 120, 124). 

The main body of this review is divided into these two sections: technostructural 
and human-processual change approaches. We exclude studies and conceptualiza- 
tions not encompassed in one of these two approaches. We omit studies which were 
not concerned with change per se, or which were not field studies in organizational 
settings. We have also not considered some approaches because of space limitations 
or inclusion in previous reviews: e.g. management by objectives, pay and incentive 
systems, staffing systems. 


RESEARCH ISSUES IN ORGANIZATION DEVELOPMENT 


As in most of the applied behavioral sciences, research and theory on OD follow 
practice. This may reflect either the difficulties of doing research on OD or the 
impotence of research in creating practical theory and contributing to improved 
practice. The first explanation locates responsibility in organizational systems for 


ORGANIZATION DEVELOPMENT 317 


lack of cooperation and appreciation of research; the second locates responsibility 
in research for lack of relevance and applicability to the organization. 

Research on organizations, even stable organizations, presents a host of difficul- 
ties. Organizations are complex systems, with many relevant variables that combine 
in intricate interactions. The student of organizations must cope with size, com- 
plexity, and sluggish responses to events. Controlled comparisons are difficult, and 
the costs of developing a large sample of organizations can be astronomical. Behav- 
ioral science is just beginning to grapple with the social psychology of research, and 
organizational researchers must cope with issues like the attitude toward the re- 
search of the respondent’s boss (2) and the obvious need by respondents to benefit 
from the research (78). 

On the other hand, issues of complexity, reactivity to research, and extraneous 
variation to research interventions reflect human and social realities. If reactivity, 
for example, is considered a “research finding,” it has as much validity as those the 
researcher expected. 

Much recent energy has been directed toward issues of reactivity, complexity, and 
extraneous variation. In one approach, the researcher attempts to avoid making his 
presence known in the research situation through unobtrusive, experimental, or 
statistical designs. The assumptions behind these methods are similar to those in the 
physical sciences—that reactivity and complexity can be successfully reduced or 
eliminated by reducing or eliminating the intrusion of the researcher. “Objectivity” 
can thereby be increased. A second approach takes the opposite stance. It attempts 
to incorporate the researcher and his data-gathering methods into the OD situation. 
The assumption is not only that the researcher-client relationship cannot be avoided, 
but that it sets a climate and structure which has clear repercussions on the research 
outcomes. This approach focuses upon the researcher-client relationship as one to 
be openly explored and implemented rather than avoided. 

Webb et al (171) have developed a host of “unobtrusive measures” that are 
collected without respondent awareness and so “uncontaminated” by his reactions 
(though their use raises some thorny ethical problems that have pragmatic conse- 
quences for the consultant). One of the risks here is that the basic value system 
implied in unobtrusiveness contradicts the values of openness and involvement 
underlying OD. Planned change interventions are necessarily active and influencing, 
not unobtrusive. A second risk is that the choice of variables which can be unobtru- 
sively collected may well preclude those which maximize relevance to under- 
standing the OD process. 

A second approach to the issue of extraneous variation and human reactivity is 
to utilize experimental design. D. T. Campbell & Stanley (45) describe “‘quasi- 
experimental” designs that allow relatively unambiguous results from field investi- 
gations. D. T. Campbell (43) has applied such designs to conceptualize “reforms as 
experiments,” and has shown that it is possible to develop relatively undistorted 
estimates of the impact of social reform. 

The concept of “convergence” also offers a third hope for dealing with the chaotic 
complexity in OD research. Convergence of flawed research designs can lead to 
relatively unambiguous conclusions when the flaws cancel each other (45). Conver- 
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gence of methodologies for measuring a single variable lead to more reliable mea- 
sures (44). Convergence of diagnostic interpretations by feedback and discussion of 
preliminary results can lead to elaborated and validated understandings of a specific 
situation (97). Convergence of different information sources on events (e.g. consult- 
ant, different groups in the organization, outsiders) can lead to increased validity 
of perceptions. Temporal convergence can be obtained by examining patterns over 
time. All these approaches lead to clearer specification of the important phenomena 
in the complex interweaving of organizational events. 

A fourth strategy is to develop data collection methods that are clearly relevant 
to organizational problem solving and so have high “face validity” to respondents: 
Blake et al’s (29) intergroup workshop, Beckhard’s (19) “confrontation meeting,” 
Jenks’ (107) “organizational Q-sort,” Argyris’ (13) content analysis of tape record- 
ings of management decision making, and Alderfer & Brown’s (6) “empathic ques- 
tionnaire” are all methodologies that rely in part for their effectiveness on their 
obvious relevance to respondents’ organizational lives. 

A fifth approach is to improve the relationship between researcher/consultant 
and respondent. Argyris (14, 15) has suggested that the quality of information 
obtained is a function of the relationships between researcher and respondent, and 
there is some evidence to support this argument. Alderfer (1) found that relation- 
ships established in interviews prior to answering a questionnaire led to franker 
questionnaire responses; Brown (38) found evidence that relationships between 
researcher/consultants and clients were improved by interaction in feedback meet- 
ings; Clark (52) noted that failure to strengthen and broaden organizational “sanc- 
tion” of research led to research termination. 

OD research can produce a range of outcomes, not all of them expected. One 
obvious product is evaluation of various types of OD intervention (e.g. Marrow, 
Bowers & Seashore 126; Bowers 35). Another expected outcome is the generation 
of new theories grounded in organizational experience (e.g. Jaques 106). OD re- 
search may also serve to guide further activities: Alderfer’s (2) study of division 
manager reactions to the researcher/consultant had implications for further work 
with the organization; survey feedback can serve as a tool for organizational! devel- 
opment (123) or as a means of increasing understanding (97); Culbert (58) has 
reported an intriguing use of research to persuade a client that his preferred inter- 
ventions were not going to solve the problem. OD research may also be an interven- 
tion in and of itself: Sanford (144) describes change in students and faculty in 
response to interviews; Hunt & Lichtman (104) report a study in which pretest 
effects outweighed the effects of training—a disturbing result for both trainers and 
pretesters. All four functions—intervention evaluation, theory generation, activity 
guide, intervention per se—are highly relevant to both researchers and consultants. 

Current research on OD is much like action research of the 1940s and 1950s (50, 
75). Like action research, OD research has tensions built into it that are likely to 
lead either to “action” (OD consulting) or to “research” (development of organiza- 
tion theory), but not to both. 

The concept of action research as originally advanced had two sets of goals built 
into it (117): both “the study of general laws” and the “diagnosis of a specific 
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situation.” But Rapoport (138) suggests that action researchers have come to subor- 
dinate research goals to action goals. Note, for example, the lack of mention of any 
development of new theories of change in French & Bell’s definition of action 
research for OD: “... action research is the application of the scientific method of 
fact finding and experimentation to practical problems requiring action solutions 
and involving the collaboration of scientists, practitioners, and laymen” (75, p. 87). 

The divergence between research-theory perspectives and action-change perspec- 
tives is sometimes attributed to contradictory postures vis-a-vis organizational prob- 
lems: the researcher’s descriptive stance, required for understanding the problem, 
contradicts the consultant’s prescriptive stance, required for developing problem 
solutions. Alternating hats requires much self-awareness and strength of purpose 
(see Argyris 15). Furthermore, researcher/consultants serve two client systems 
whose requirements and criteria for evaluation are at least superficially incompati- 
ble: consumers of research want “general laws” and sneer at “applied research,” and 
consumers of consultants want “specific diagnoses” and scoff at “abstract theoriz- 
ing.” Reputation with one client system may be purchased at the cost of reputation 
with the other. 

Although these pressures create tensions for researcher/consultants, the separa- 
tion of roles is not inevitable, or even desirable. Developing teams of specialists is 
one response to the problem (e.g. Blake et al 28, Miles et al 130). But in the long 
run, greater integration of the two perspectives within a single theorist-researcher- 
practitioner person is necessary. 

The processes which researchers and consultants utilize seem similar in a number 
of ways. Consultants tend to build a data base from their previous experiences, 
develop theory of practice inductively, apply this practice-theory, note the repercus- 
sions, and revise their practice-theory accordingly. Researchers utilize a similar 
method. The researcher’s inductive process draws more upon external data (previ- 
ous research studies and literature) rather than previous personal experience; he 
tends to test his theory deductively in a more external sense through instruments 
and observational methods rather than through direct experience and involvement. 
But there is much that each can learn from the other. The consultant may need help 
in building a systematic method for learning and practice, and researchers might 
learn much from investigating the action implications and impact of their findings. 
If science is knowledge-getting (as its Latin derivation indicates), both researchers 
and practitioners can use as much of it as they can get. There is evidence that more 
OD research is being done and that interest in trying to combine action and research 
is on the rise (144). Ultimately, the field needs a metatheory of research and practice 
that clarifies and integrates the choices confronting the researcher/consultant. Some 
of the groundwork for that theory has already been laid. 

In one sense the very dilemmas that OD research faces are its opportunities to 
create a new approach to knowledge-getting and knowledge-giving. It faces these 
dilemmas because, unlike many disciplines, it is a humanistic, change-oriented, 
professionally practiced field. Its current research methods are based on traditions 
that knowledge-getting and knowledge-giving are separate; that evaluation and 
validation of change lead to understanding development processes; that external 
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rigor, control, and standardization are ideal methodological characteristics; that 
knowledge necessarily produces change. 

Most research reported in this review, for example, has taken an evaluative stance, 
utilizing reliable pre- and post-measures selected and administered by the researcher 
to increase his knowledge of change and development in the “other” (e.g. the person, 
group, Organization, etc). Pre-post measures provide for the practitioner and the 
client systems data at only two points in time, rather than a continuous picture of 
the interactive developmental process. Continuous documentation (e.g. a detailed 
case study) provides data on the ongoing interactive processes which occurred, but 
does not provide immediate feedback to guide subsequent action steps. The issue is 
somewhat similar to suggesting to a person about to shave his face to look at 
yesterday’s photographs of his preshave and postshave face instead of looking in the 
mirror. A moving picture would provide continuous data, but still not the immedi- 
ate feedback or client control that shaving in a mirror provides. Thus for consultant 
and client relevance, research is most useful if it provides immediate continuous 
feedback of the process and progress of intervention and development. OD theory 
based on this kind of research would provide the greatest validity on change pro- 
cesses and might also provide the most relevance to practitioners. 

This calls for collaboration between researcher and organization members well 
beyond current unilateral research relationships (77a). It calls for more open com- 
munication on the conflicting interests between the two parties as a mutual problem 
to be solved collaboratively, a recognition of the legitimacy of each other’s interests, 
and a search for solutions which are responsive to both sets of needs. 


TECHNOSTRUCTURAL APPROACHES 


Technostructural approaches to OD refer to theories of and interventions into the 
technology (e.g. task methods and processes) and the structure (e.g. the relation- 
ships, roles, arrangements) of the organization. Technostructural approaches are 
rooted in the fields of engineering, sociology, psychology, economics, and open 
systems theory. Change interventions are intended to affect the work content and 
method and to affect the sets of relationships among workers. Within the broad 
heading of technostructural development are included sociotechnical systems per- 
spectives, job design and enlargement, and job enrichment. These methods are easier 
to separate in theory than in practice, but the conceptual distinction among them 
has implications that are important enough to warrant their preservation in this 
discussion. 

Sociotechnical systems and job design are, in part, a reaction to and an emergence 
from two earlier and still current perspectives of change: (a) scientific management 
and industrial engineering which focus upon the physical environment and physio- 
logical requirements of workers; and (b) psychology and social psychology, which 
focus on the social relationships and personal needs of the workers. Proponents of 
sociotechnical systems and job design criticize the physical approach for treating 
social groups and individuals mechanistically, and criticize the psychological ap- 
proach for ignoring the technology of the organization or treating it as unchangea- 
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ble. Further criticism is that both approaches treat the organization as a closed 
system, dealing with intraorganizational issues in piecemeal fashion and ignoring 
the important linkages between the organization and its environment (60). 

The concept of sociotechnical systems has been applied to the work initiated in 
England, particularly at the Tavistock Institute. Sociotechnical interventions are 
directed at the fit between the technological configurations and the social structure 
of work units. The term “job design” until recently has been applied to develop- 
ments in the U.S., particularly those initiated at the University of California at 
Berkeley. Job design studies have sought to manipulate the configuration of technol- 
ogy, as in task designs and assignments making up jobs, and to explore the effects 
of these changes on personal, social, and organizational phenomena (60). We have 
grouped job design studies together with job enlargement studies since the method- 
ologies are similar. Job design and enlargement are concerned with increasing 
satisfaction and performance, although there seems to be greater concern in job 
design to enhance performance, and greater concern in job enlargement to enhance 
job satisfaction. Job enrichment is derived from theories of motivation, whereas job 
enlargement is based on industrial engineering principles. In job enrichment, work 
functions from a vertical slice of the unit are brought together into a single job to 
increase the challenge of that job (and therefore the motivation of the jobholder); 
in job enlargement, work functions from horizontal slices of the work unit are 
consolidated to provide greater variety and a sense of the whole task. 

There are also differences in scope among these three approaches. Sociotechnical 
systems take into account the entire system, job design focuses upon sets of inter- 
related functions, and job enrichment is concerned primarily with single jobs. Suc- 
cessful job enrichment, for example, may enrich a single job at the expense of other 
jobs in that unit. Sociotechnical systems, on the other hand, frequently rearrange 
relationships among roles or tasks or the sequence of activities. 


Sociotechnical Systems 


In one of the earliest studies in sociotechnical systems, Trist & Bamforth (163) found 
that two different social systems in British coal mining were associated with very 
different levels of productivity, absenteeism, and accidents. Traditional coal mining 
utilized small, cohesive work groups working as autonomous teams, each establish- 
ing its own system of work with each worker performing a variety of tasks. Techno- 
logical advances Jed to a change in which work groups of about 40 men in three 
shifts were formed, with their activities spread over a wide area. The task require- 
ments prohibited close interpersonal relations and group identification; workers 
were assigned to narrow tasks and paid on an individual rather than a team basis. 
The longwall method resulted in lowered performance, higher absenteeism, feelings 
of passivity and indifference and emotional strain. When many of the social and 
group relationships were reintroduced into a “composite longwall method,” produc- 
tivity, attendance, safety, and morale were substantially improved (70, 164). 
Rice (140), another member of the Tavistock group, applied similar concepts in 
an Indian weaving mill. Installation of automatic looms and task specialization led 
to a decline in the quantity and quality of woven products. Rice introduced a 
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number of changes: the number of job levels was reduced from nine to three, tasks 
were made interchangeable, and semiautonomous work groups were created and 
placed in charge of a set of looms. During the following 2 years, productivity rose, 
wasted-cloth rate declined, and morale rose dramatically. 

More recently, Tavistock group members have worked with the employee rela- 
tions group of Shell UK Limited to develop and diffuse throughout the organization 
a philosophy of management that explicitly emphasizes the maintenance and devel- 
opment of human resources as well as productivity as corporate goals (100). Origi- 
nally conceived as the initial step in a company-wide effort to decrease worker 
alienation and apathy, the diffusion of the new philosophy and the subsequent 
changes in procedures and productivity bargains presage major changes in the 
quality of life and levels of productivity in the organization. 

On a larger scale, the Tavistock group is now heavily involved in experiments in 
“industrial democracy” being carried out at the national level in Norway. In col- 
laboration with workers, employers, and the Norwegian government, a number of 
field experiments in the effects of different work philosophies and sociotechnical 
arrangements have been carried out. Though the results available are only prelimi- 
nary, there is evidence (70a, 160a, 162a) that the changes do indeed have important 
effects on both productivity and alienation. 


Job Design and Job Enlargement 


Like the sociotechnical systems approach, job design and enlargement focus upon 
both the technical aspects of the job and the social structures which support it. They 
also attempt to increase satisfaction and performance by building in greater variety, 
discretion, feedback, identity, and responsibility for whole task completion. 

In one of the earliest studies in job design, Marks (125), as reported in Davis & 
Canter (61), conducted an experiment on the shop floor to change the configuration 
of technology through task design and job assignments. For example, operations, 
inspection, and securing of materials were combined into one job and performed at 
individual work stations. This design resulted in improvements in output, quality, 
production flexibility, and job attitude. In a similar study (57) in which assembly 
jobs were changed over to more independent bench jobs, production time decreased 
and quality increased. Preference for self-pacing was found as the reason for prefer- 
ring bench jobs, but social interaction and work interaction showed sharp reduc- 
tions. Similar results have been found for enlargement of maintenance jobs (63). Job 
enlargement for supervisors has resulted in a greater orientation to the technical 
problems of production and to worker training. Although no changes were observed 
in production, costs, absenteeism, grievances, or transfers, there were significant 
increases in quality (62). In a summary of a number of job enlargement projects, 
Stewart (158) found that quality of product improved, labor costs decreased, opera- 
tors generally preferred job enlargement in a short time, and problems inherent in 
paced groups largely disappeared. 

Alderfer (3) concludes in a review that job enlargement results in increased overall 
job satisfaction and greater meaningfulness of work. Lawler (111) reviewed ten 
studies and found quality improvements in all, but productivity increases in only 
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four. Davis (60) explains such improvements as a consequence of increased number 
and variety of tasks, self-determination of pacing, increased responsibility for qual- 
ity, increased discretion of work methods, and sense of work completion. Alderfer 
(3) found that enlarged jobs are judged by non-jobholder experts as requiring more 
technical competence and opportunity for innovation, more variety of duties, and 
more need to deal with people. Incumbents of enlarged Jobs reported increased 
satisfaction with opportunities to use their skills and abilities and decreased satisfac- 
tion with their relations with superiors. Friedman (82) reports a study in which 
interaction.and communication between foremen and workers increased, resulting 
in increased prestige for workers. However, at least two studies report negative 
effects of job enlargement on interpersonal work relationships. Conant & Kilbridge 
(57) note that social interaction opportunities and work interaction showed sharp 
reductions in bench work. Alderfer (3) reports that job enlargement resulted in 
lower satisfaction with respect from superiors. 

Alderfer (3) suggests that differing results of job enlargement on interpersonal 
relations is contingent upon the technology. In continuous process technology, 
where there is a high demand for interdependency among employees, job enlarge- 
ment may place greater stress on these interdependencies, resulting in lowered 
satisfaction. 


Job Enrichment 


Job enrichment is the restructuring of the content of a set of jobs or functions which 
are vertically related to enhance the employee’s opportunity for responsibility, 
achievement, challenge, and growth. It generally involves collapsing various vertical 
functions into a single more responsible function. Proponents of job enrichment are 
somewhat critical of job enlargement, claiming that job enlargement condenses the 
boredom and monotony of several jobs into one job. Job enrichment as an organiza- 
tional process evolved from the two-factor motivation theory developed by Herz- 
berg, Mausner & Snyderman (99); the theory stresses that man’s needs at work are 
met essentially by the nature of his work. The major motivators that can be built 
into work are increased achievement, recognition, responsibility, advancement, task 
capability, and knowledge. Herzberg’s research and theory (98, 99) have focused 
upon individual motivation rather than upon the implementation of organizational 
change processes which might increase motivation. It is essentially a theory of 
motivation rather than a theory of change. 

The most widely known study in the implementation of job enrichment was done 
by Ford (72) at American Telephone. Underlying these efforts has been the assump- 
tion that increased motivation cannot result from training or other attempts to instill 
in workers greater energy and involvement. Rather, all one can do is provide the 
employee a task which offers greater challenge; he will then become intrinsically 
motivated. The initial target of Change is job content, not the employee. Ford (72) 
studied a number of job enrichment programs at American Telephone in a variety 
of departments with different tasks, utilizing several different methods to introduce 
the job enrichment intervention. An effort was made to keep the environment of the 
job constant so that results could be attributed solely to change in the job content. 
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Results indicated that turnover and absenteeism were reduced, quality of service was 
increased, and employee attitudes toward work improved in work units which 
received job enrichment. 

There are a number of reviews of job enrichment applications (10, 122, 143) and 
several additional accounts of application findings (129, 135). Although the number 
of well-designed studies is too few to make broad generalizations, there are indica- 
tions that job enrichment has positive effects on satisfaction and motivation (72). 
There is reason to believe that the quality of production increases more than the 
quantity (10, 111), although this applies to applications which combine job enrich- 
ment and enlargement. In much of the literature, the concept of “motivation” is 
used as a convenient device to explain increased performance or satisfaction. The 
direct effect on motivation of job enrichment programs is unexplored. 

There are a number of questions regarding the relative impact of the process 
through which changes are introduced and implemented and the impact of the 
restructured job. In some studies employees participated in decisions, while in others 
only supervisors influenced the restructuring. The assumption which underlies di- 
rective implementation is that task related experiences lead to changes in attitudes 
and needs (89, 121); changed work experience will result in attitude change, rather 
than vice versa. In connection with this, Hackman & Lawler (92) and Alderfer (3) 
suggest that experiencing higher order need satisfaction will result in the develop- 
ment of higher order needs. 

Job enrichment and job enlargement generally ignore individual and technologi- 
cal differences; most proponents assume that job enrichment or enlargement are 
applicable and relevant regardless of the individual concerned and regardless of the 
technology of the organization. But Turner & Lawrence (165) found that enriched 
jobs resulted in greater satisfaction and less absenteeism for small-town workers but 
not for urban employees. Similarly, Hulin & Blood (103) and Blood & Hulin (33) 
found that workers who hold traditional values about work and achievement re- 
spond positively to complex jobs, but more alienated (urban, blue-collar) workers 
respond negatively to complex jobs. Hackman & Lawler (92) report positive correla- 
tions between job complexity (variety, autonomy, task identity, feedback) and moti- 
vation, satisfaction and performance only for employees with higher order need 
strength (e.g. for accomplishment, growth, challenge, variety, participation). Wa- 
nous (170) found that higher order need strength accounted for more of the relation- 
ships between job complexity and positive outcomes than either the blue-collar, 
urban distinction made by Hulin & Blood (103) or the Protestant ethic values 
distinction made by Blood (32). Standing (156) found that the degree of satisfaction 
within a single set of jobs varied as a function of the cognitive complexity of the 
employee. The degree of satisfaction with the work itself was an inverted U-shaped 
function of cognitive complexity. These studies suggest a contingency theory in 
which the probability of job enrichment and enlargement resulting in increased 
performance or satisfaction is contingent upon the needs of the employee, his 
cognitive complexity, and his cultural milieu. 

Similarly, increases in performance and satisfaction resulting from these changes 
may be contingent upon the technology of the organization. Task identity, for 
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example, may be difficult to establish in a continuous process technology, but 
autonomy and feedback seem easier to implement in an automated process technol- 
ogy than in heavy assembly work (31). Several organizations are finding job enrich- 
ment and enlargement simply impossible given their huge investments in stationary 
equipment. Other contingencies which the organization may face include lack of 
sufficient skills to take on more demanding work, short-term lowered performance 
which frequently accompanies organizational change, union resistance or disinterest 
(143). 

The sociotechnical studies reviewed in this section consistently indicate increased 
performance and productivity—a finding that is less clear for job design/enlarge- 
ment or job enrichment. The evidence on job enrichment is too sparse as yet to make 
unqualified claims as to its effectiveness; more research is needed. All three ap- 
proaches seem to result in quality increases, lowered absenteeism, and lowered 
turnover. The evidence for production time decreases and cost savings is mixed for 
job enlargement and job enrichment. In terms of attitudinal change, job satisfaction 
tends to increase as a result of job enlargement and job enrichment. Job enlargement, 
however, tends to result in a more socially isolated role for the workers, which 
increases both the need and difficulty in relating to others with whom their tasks 
are interdependent. Sociotechnical systems avoid this issue by social restructuring 
of the work group, resulting in clear increases in group morale. 

Changes in job design have typically incorporated many simultaneous changes in 
task variety, self-determination and discretion, responsibility, challenge, and whole- 
ness; in the degree to which the task requires hierarchical, collaborative, or autono- 
mous behavior; and in different technologies. Research has generally not attempted 
to isolate these as independent variables, nor has it explored relative changes in 
different kinds of motivation, satisfaction, and performance. We need to explore the 
relative effects of participative and nonparticipative change processes upon the 
implementation and success of job design changes. At this point, much of what we 
can Say is: given these employees, this technology, this change process, and this job 
design structure, this happened in terms of satisfaction and performance. We need 
more specificity of the complex interactions of independent, dependent, moderator, 
and process variables to specify cause, effect, and contingency. 


HUMAN PROCESSUAL APPROACHES 


Human process intervention focuses on the human participants and the organiza- 
tion processes (e.g. communication, problem solving, decision making) through 
which they accomplish their own and the organization’s goals. This orientation to 
OD is rooted in the academic fields of psychology, social psychology, and an- 
thropology and in the applied disciplines of group dynamics and the human rela- 
tions movement. Human process-orientation change agents tend to value human 
fulfillment highly and to expect improved organizational! performance to follow on 
improved human functioning and processes (152). These two don’t necessarily go 
hand in hand, even though it may be more comfortable to assume this. 
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Process-oriented consultants typically have been sensitive to the nature of the 
consultant role (145, 157). A good deal of thought has been given to some of the 
dilemmas experienced by consultants (11, 15), the problems of developing a valid 
organizational diagnosis (15), and the risks of alternative interventions (94). 

Although practitioners have developed a number of alternative interventions 
concerned with human processes in organizations (cf. Beckhard 20, Fordyce & Weil 
73, Burke & Hornstein 42, French & Bell 75), most research efforts focus on three 
general areas of process-oriented intervention: survey feedback, group development, 
and intergroup development. 

These three intervention strategies are founded on some common assumptions: 
sharing information can be valuable, particularly when it hitherto has remained 
unshared but has influenced organizational processes (like some covert feelings); 
confronting and working through differences among people who must work together 
can enhance collaboration; participation in decision making can lead to increased 
commitment. Survey feedback involves organizational groups in discussing diagnos- 
tic data and planning action steps. Group development activities emphasize improv- 
ing group abilities to accomplish their tasks. Intergroup development interventions 
press for improved management of the interfaces between groups. These three forms 
of intervention are at least potentially compatible, and often are used in sequence, 
beginning with survey feedback and moving to group and intergroup development 
activities. 


Survey Feedback 


Survey feedback is a process in which data is systematically collected (usually by 
questionnaires) from members of an organization, analyzed in summary fashion, 
and fed back selectively to organization members. To varying degrees, outside staff 
and organization members collaboratively design the questions to be asked, jointly 
analyze and interpret the data, and feed back in meetings to organization units from 
which the data was collected for purposes of diagnosis and potential change. The 
intervention has developed from traditional attitude surveys which were adminis- 
tered to employees and fed back to management; in recent years findings have been 
presented to employee respondents as well, partly to gain their cooperation in future 
studies. Such feedback has now taken on clear organizational development purposes 
as well (36, 130), on the assumption that discrepancies between organizational ideals 
and actual responses to the survey will generate motivation for change (136). 

Early participation of organization members in the design and collection of data 
is likely to increase the relevance of the feedback (123, 130, 132). Organization 
members may be asked to develop questions for the survey, or they may be inter- 
viewed to determine what issues are relevant (130). Early involvement in one study 
increased awareness of interpersonal problems between supervisors and subordi- 
nates (7). 

The part played by organizational superiors in the feedback meetings is critical. 
Baumgartel (17) found that perception of supervisory behavior changed as a conse- 
quence of increased information flow and problem confrontation between hierar- 
chical levels after feedback. Chase (49) reports that the feedback process tended to 
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equalize power even in a highly threatening environment, and Klein, Kraut & 
Wolfson (108) report that use of line managers rather than personnel representatives 
to feed back data resulted in more satisfaction and greater perceived utilization of 
the data. Participation of superiors in discussion of the feedback may facilitate 
effective use of the information. On the other hand, a resistant or antagonistic 
superior can undermine the process; Alderfer & Ferris (7) suggest that managers 
meet in peer groups to prepare for feedback of potentially threatening results before 
meeting with their subordinates in family groups. 

External consultants can play an important role in the survey feedback process; 
they may design instruments, analyze, present and interpret the data, plan for action 
steps, and analyze the process of the feedback meetings (cf. Miles et al 130). Alderfer 
& Ferriss (7) found that participants reacted more favorably to feedback by outsider- 
insider consultant teams than to insiders alone, and Chase (49) reports better 
problem-solving and meeting outputs with an outside consultant present. 

Processes which occur during the feedback meetings also appear to influence the 
outcomes of those meetings. In the Detroit Edison studies, groups which had more 
intensive feedback sessions (aided by a consultant) showed more positive attitude 
change than groups with less intensive sessions (17). Brown (38) found evidence that 
participants were more receptive to feedback after the meetings, and Klein, Kraut 
& Wolfson (108) found that face-to-face feedback meetings were more effective than 
written reports. Participation in feedback meetings can effect the group’s percep- 
tions of problems (49), its expectations of improvement (7), and the quality of its 
interaction (130). l 

It seems clear that survey feedback meetings can lead to attitudinal changes by 
participants. Miles et al (130) report improved satisfaction with decisions (even 
though they were not implemented); Mann (123) found improvement in attitudes 
toward work, supervisors, progress, and group ability to get the job done; Bowers 
(35) reports improvement in organizational climate, managerial task and interper- 
sonal leadership, and satisfaction; Brown (38) found increased participant involve- 
ment in the organization. 

But the longer term changes in individual behavior or organizational performance 
appear to be contingent on more than just survey feedback. Despite highly successful 
meetings and increased satisfaction, Miles et al (130) found that few action steps or 
structural changes emerged from the meeting and so little change occurred. Brown 
(38) notes that the higher involvement did not persist without follow-up, and 
Frohman (83) found that more change occurred with consultant follow-up than 
without it. 

The research literature suggests that the effectiveness of survey feedback can be 
increased by collaborative involvement of the participants, participation of unit 
management, facilitation by an outside consultant, and specific decisions about 
follow-up and action steps. There is evidence that survey feedback can be an effective 
“bridge” between diagnostic activities (e.g. interviewing or questionnaire adminis- 
tration) and active intervention, since its primary effects seem to be on attitudes and 
perceptions of the situation. But there is little evidence that survey feedback alone 
leads to changes in individual behavior or organizational performance. 
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Group Development Intervention 


OD practitioners have emphasized the importance of the work group rather than 
the individual (40, 102). Group development interventions like “team-building,” 
according to French & Bell, are “probably the most important single group of (OD) 
interventions” (75, p. 112). 

Descriptions of different forms of team-building technology are easily obtained 
(e.g. Clark 53, 54; French & Bell 75; Fordyce & Weil 73; Golembiewski 86; Burke 
& Hornstein 42). Beer (21) classifies the different approaches to group development 
according to the primary issues they treat: (2) goal-setting activities (18, 20) that 
establish clear goals for the team to achieve; (b) interpersonal relations development 
(12, 13, 152) to improve the quality of interaction among team members; (c) role 
analysis work (65, 95) for increased clarity about each member’s role and responsi- 
bilities; and (d) “Managerial Grid” group development (26) to prepare the group 
for later phases in the grid OD program. 

The literature contains a number of case studies of group development activities 
from the vantage point of the consultant (69, 110). But these case studies offer little 
more than the flavor of the experience. 

The impact of team-building activities has been explored systematically by several 
investigators. Argyris (12) and Harrison (93) report interview, observation, and 
questionnaire data from three groups of managers, two of which experienced T- 
group training and a comparison group. The study was frankly exploratory and 
suffers from methodological problems, but the data suggested changes in values and 
behavior of participants occurred. In a subsequent study, Argyris (13) trained an 
executive group to be more “interpersonally competent”; time series analysis of tape 
recordings suggested that the training affected executive meeting behavior, though 
only anecdotal evidence for the relationship between the changed behavior and 
increased effectiveness was available. In a larger study, Friedlander (76) found that 
four trained teams reported significantly higher levels of group effectiveness, mutual 
influence, and personal involvement and participation than did the eight control 
groups. Schmuck, Runkel & Langmeyer (148) trained the faculty of a junior high 
school and compared the experimental faculty with the faculties of several similar 
schools. The experimental faculty reported more positive perceptions of the princi- 
pal, the staff meetings, and the level of innovations in teaching, and they espoused 
more norms consistent with laboratory values than the controls; the authors also 
report anecdotal data about positive changes. 

The critical elements of the team-building process remain only partially explored. 
Friedlander further analyzed the data from his field experiment (76) and found that 
the impact of the team building varied greatly across the four participant groups 
(77). Comparison of the most positively affected group with the two least positively 
affected led to the discovery that the context in which the team building took place 
was very important: “... the quality of integrated prework and postwork processes 
surrounding the laboratory is a far more potent determiner of developmental impact 
than are variations in trainer role and behavior or differences in climate and content 
of laboratory training session” (77, p. 395). Another analysis of the same data (79) 
revealed that the impact of team-building activities was also influenced by the level 
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of intragroup trust at the outset. Initial intragroup trust was a better predictor of 
subsequent perceptions of group effectiveness and the worth of group meetings than 
the initial ratings on the same dimensions, But intragroup trust itself was not 
increased by the training except when pre- and postwork with the consultant oc- 
curred. Harvey & Boettger (96) found that a brief “experiment” involving member 
confrontation of the group leader led to behavior change. They also found that the 
four participants who confronted the leader during the “experiment” were signifi- 
cantly more positive about him a year later than the eight participants who did not. 
Since those who confronted the leader selected themselves, we cannot be sure that 
the confrontation alone accounted for the differing perceptions, but the finding is 
consistent with the theory that confrontation is a critical element of successful team 
building, even if it cannot be considered unambiguous. 

There remains a dearth of evidence for the effects of team building external to the 
group developed. Harrison (93) noted that the changed interpersonal perceptions 
of Argyris’ T-group participants did not generalize to the participants’ subordinates. 
Bigelow (25), on the other hand, found that participation in group development 
activities affected teachers in the classroom, even though the group development 
activities were not designed to change classroom behavior. Analysis of tapes of 
teacher classroom behavior indicated that partictpants became more integrative and 
less dominative than controls, and that student perception of relationships with their 
peers improved in the classrooms with trained teachers. Fosmire, Keutzer & Diller 
(74) report the results of group development with the entire faculty of a new high 
school. Their intervention led to increased interpersonal openness and acceptance 
of conflict among the faculty and to increased perceptions by students of the faculty 
as responsive and themselves as responsible. 

Though none of the research designs is flawless, there is convergent evidence that 
group development activities affect participant attitudes and sometimes their behav- 
ior as well. These effects may also “spill over” in some fashion to other organization 
members. It remains unclear, however, what mechanisms operate in successful team 
development activities, or what critical conditions must be satisfied for successful 
generalization of learnings outside the team, or what effects group development has 
on actual task performance. 


Intergroup Relations Development 


Problems at the interfaces between groups in an organization are endemic in modern 
organizations, and they are a matter of central concern to OD practititioners (23, 
75). Lawrence & Lorsch (113) suggest that intergroup relations problems are an 
inevitable correlate of the differentiation of organizational subsystems to meet and 
deal with environmental complexity. Important conflict has been reported between 
different departments in business organizations (68, 151), between headquarters and 
field teams (30), between field and administrative officers of the State Department 
(23), between a management development program and other subsystems in a bank 
(4), between union and management representatives (29, 30), among different occu- 
pational specialties in coal mines (164), and between potential merger partners (34). 

There is no shortage of ideas for managing intergroup conflict. Some theories have 
been developed in social psychological research (154), others from work with orga- 
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nizations (141). Neilsen (133) describes seven strategies for intervening in intergroup 
conflict that vary in the degree to which they deal with behavior alone or with both 
behavior and attitudes. Walton (167) has suggested four interventions for third 
parties to help manage intergroup conflict: 1. reduce the potential for conflict (e.g. 
change structure or personnel); 2. resolve substantive issues (e.g. make the deci- 
sions); 3. help manage manifest conflict (e.g. be a referee); 4. facilitate a change in 
the relationships. 

The first two alternatives require power over the combatants, an option that some 
technostructural consultants have used to great effect. Trist et al (164), for example, 
eliminated counterproductive intergroup conflict among coal miners by changing 
the sociotechnical system. But most human process consultants have no such power 
or they decline to use it. They have preferred the latter two interventions: helping 
manage manifest conflict and facilitating relationship changes. Walton (168) offers 
some provocative insights into the complexities of the third party role; his approach 
involves improving intergroup relations by improving relations between their repre- 
sentatives. 

A more common approach has been to foster a general “problem-solving” ap- 
proach to the relations between groups. Blake, Shepard & Mouton (30) argue for 
problem solving in the context of a nine-cell typology of intergroup conflicts based 
on the degree of inevitability of the conflict, the size of the stakes, and the possibility 
of agreement. Schmidt & Tannenbaum (146) suggest four different approaches, 
culminating with problem solving. “Problem solving” is the preferred outcome for ` 
both if the groups are interdependent. 

Intergroup development interventions to change relationships and manage con- 
flict in practice are based on information sharing, confrontation of differences, and 
working through to new understandings. The prototype design requires groups to 
develop lists of their perceptions of themselves, the other group, and their views of 
the other group’s perceptions of them, and then to use the lists as input to developing 
better understanding of each other and to creating future action plans. This design 
was developed by Blake and his colleagues (29, 30), but it has been used and 
recommended by many others (e.g. Beckhard 20; French & Bell 75). 

There is very little systematic research on the effectiveness of such interventions 
in the field. Case studies abound (e.g. Blake, Mouton & Sloma 29), but they leave 
many questions about the efficacy of the intervention unresolved. Golembiewski & 
Blumberg (87, 88) report attitudinal changes after a workshop that included con- 
frontation among several groups from the same organization, but while participant 
attitudes changed in the ways predicted by the investigators, no data were collected 
on the consequences of the intervention for individual behavior or organizational 
performance. Further, effects of the intergroup intervention may have been con- 
founded by a simultaneous confrontation between the participants and the consult- 
ants (who, after all, constituted an extra-organizational group that was potentially 
a unifying “external enemy” for all participants). 

It is painfully evident that more research is needed before much can be said about 
the utility of the present process-oriented technology for managing intergroup rela- 
tions. There is a good deal of activity going on, but relatively little in the way of 
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unambiguous findings available. We simply do not know much about whether OD 
interventions lead to better management of intergroup relations or not. 

A related area that demands further investigation is the role of the consultant in 
the management of intergroup relations. The consultant’s impact as representative 
of a group—a group external to and possibly a threat to his client—is not often 
considered. Lewicki & Alderfer (116) have explored some of the dilemmas encoun- 
tered by researcher/consultants to intergroup conflicts, but further investigation of 
this aspect of OD research/consultation is needed. 


COMPARATIVE STUDIES OF OD INTERVENTIONS 


One approach to comparison of OD interventions has been to examine successful 
and unsuccessful interventions reported in the literature. Greiner (91) compared 18 
studies of organization change (technostructural, human-processual, multifaceted, 
and non-OD approaches) and found three features that distinguished successes: 
(a) successes consistently arose from strong internal and external pressures for 
change; (b) successes followed a consistent pattern from initial pressures to the 
gradual involvement of many levels including top management in diagnosis and 
change activities; and (c) successes were characterized by shared decision making 
rather than unilateral or delegated decisions. 

Buchanan (39) derived three OD “crucial issues” that separated the successful 
from unsuccessful from the study of ten cases of human process interventions: (a) 
successes introduced a new model of operation to guide changes; (b) successes 
established linkages with others inside the target system; and (c) successes estab- 
lished linkages with others outside the target system. 

But Greiner’s (91) and Buchanan’s (39) studies do not evaluate alternative inter- 
ventions. Bowers (35) has compared empirically the impacts over time of four 
interventions—interpersonal process consultation, task process consultation, labo- 
ratory training (group development), and survey feedback—and two control condi- 
tions—data handback and no treatment—on a host of attitudinal variables. Survey 
feedback, interpersonal process consultation, and data handback led to positive 
changes on a majority of the dependent measures, while task process consultation 
led to no change, and laboratory training and no treatment Jed to negative changes. 
Further analysis indicated that changes in perceived organizational climate (human 
resources primacy, communication flow, motivational climate, decision-making 
practices, technological readiness, and lower-level influence) influenced the impacts 
of the interventions; without positive climate changes no interventions had very 
positive effects, and with them even laboratory training helped. The only interven- 
tion that directly improved organizational climate, however, was survey feedback. 

Bowers’ (35) findings about the impact of survey feedback may reflect the impor- 
tance of developing a shared conception of problems and appropriate action steps, 
an interpretation that is consistent with Buchanan’s (39) emphasis on a “new 
model” to organize change efforts, Greiner’s (91) “shared” decision making about 
diagnoses and solutions, and our earlier description of survey feedback as an apprro- 
priate “bridge” between diagnosis and subsequent interventions. 
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MULTIFACETED ORGANIZATION DEVELOPMENT 


There are a number of reports of organization development activities with organiza- 
tions as a whole. Some are anecdotal case studies (e.g. Beckhard 18; Davis 64; 
Alschuler 9), and some evaluations of interventions based on attempts to collect 
“harder” data, albeit with design problems (e.g. Blake et al 28; Huse & Beer 105; 
Zand, Steele & Zalkind 175). Relatively few reports, however, combine interven- 
tions into both technostructure and human processes with systematic efforts to 
evaluate the interventions’ effects or to account theoretically for the phenomena 
observed. 

Whyte & Hamilton (173) report work that predates the term “OD,” but that fits 
under the OD rubric. With only a hazy theory of organizational change, the authors 
began an avowedly exploratory project with the “Tremont Hotel” in 1945. They 
observed and interviewed people in five departments and top management, and 
developed solutions to problems through group meetings of those involved. The 
organizational changes that resulted included redefined roles, improved productiv- 
ity and safety records, decreased turnover, and improved interpersonal relations. 

Another pioneering study was reported by Jaques (106), who began work in the 
late 1940s with the Glacier Metal Company in England. He reports five cases of 
work with company subsystems that requested the help of the research team, whose 
task was: “... developing methods of offering technical assistance to groups that 
requested the help of the team in exploring underlying and concealed forces— 
whether psychological, cultural, structural, or technological—that were impeding 
their progress or otherwise reducing their efficiency” (106, p. 306). The team ob- 
served and interviewed members at work, and offered their impressions and inter- 
pretations at group meetings to help clarify underlying issues. Although Jaques 
claims no responsibility for organizational changes, it is clear from his descriptions 
that the interventions of the research team often influenced discussion and decisions 
taken. 

Seashore & Bowers (149) studied an explicit attempt to change three of an 
organization’s five departments through several interventions including problem- 
solving meetings, seminars, survey feedback, and personal coaching and counseling. 
Results supported the hypothesis that the interventions contributed to positive 
change on pre-established dimensions, and that the experimental groups changed 
more positively than the controls. 

In a subsequent study, the same authors collaborated with Marrow to study the 
changes in the Weldon Manufacturing Corporation after its acquisition by a com- 
petitor (126). The new owners launched a change program that included technical 
system changes, widespread training, more careful selection, revised payment sys- 
tems, and increased participation in decision making and problem solving at all 
levels. Results included positive attitude changes, increased return on capital and 
production efficiency, and decreased turnover and absenteeism relative both to past 
performance and to the company’s major competitor. Follow-up research 7 years 
later (150) indicated that many changes were durable over that period. 

Hill (100) studied a major sociotechnical change process in the Tavistock tradi- 
tion of OD at Shell UK Limited. Carefully orchestrated interventions to clarify 
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company objectives and values, share them with all levels, and then to share respon- 
sibility for increasing profitability and for meeting community needs led to positive 
changes in understanding, levels of individual skill, and organizational climate. Of 
23 departments, 15 reported organizational or procedural changes, 10 reported job 
design changes, 10 reported changes in responsibility and commitment, and 5 re- 
ported improved performance. There were also changes in plant design, staff ap- 
praisal systems, manpower planning, and job enrichment programs. 

Finally, Lawrence & Lorsch (112) have developed a “contingency theory” of 
organization that describes organizational structure and process contingent on the 
environmental uncertainty. They have explored the implications of contingency 
theory for OD and reported change efforts in a variety of organizations that seek 
to take into account the interfaces between individuals, groups, the organization, 
and its environment. Although only crude efforts at evaluation have been made, the 
theory is one of the most promising yet available in OD, in that it attempts to 
account systematically for both structural and processual variables and interven- 
tions at various levels in the organization. 

More recently, Clark & Krone (55) have developed a framework for open systems 
design and planning which encompasses the individual, his group process and 
structure, and the realities of his perceived organizational environment. 

Of the six cases, three—Whyte & Hamilton (173), Jaques (106), and Lawrence 
& Lorsch (112})—emphasize exploration of the processes of organizational change, 
and their primary contribution is to conceptual formulations of those processes. The 
others—Seashore & Bowers (149), Marrow, Bowers & Seashore (126), and Hill 
(100)--emphasize careful evaluation of explicit interventions into organizational 
structures and processes on behalf of planned change. The evidence suggests that 
such projects can lead to changes in organizational performance as well as changes 
in individual attitudes and behavior. 

Examination of the six reports also suggests some other observations. First, OD 
is not a short-term process; on the contrary, these cases took years to bear fruit. 
Second, it is important to have the support and active involvement of top manage- 
ment in the project; without that support it is difficult to gain commitment from 
lower levels. Third, in these projects interventions were made at several levels of the 
organizational hierarchy; support of top management by itself is not a sufficient 
condition for success. Finally, unambiguous evaluation of OD interventions is ex- 
ceedingly difficult to arrange even with professional researchers involved as in these 
cases. 


CONCLUSIONS 


We have described OD as a planned change effort where the intervention is at the 
individual and process (human-processual) levels or the technological and structural 
(technostructural) levels. Excluded from our concept of OD are change efforts 
which deal only with individuals and do not make parallel efforts to change the 
structures and processes in which these individuals live and work, change efforts 
which focus entirely upon the technological factors and do not consider the struc- 
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tures and processes needed to support these changed technologies, and change 
efforts which impose structural reorganization without considering human or tech- 
nological needs and demands. We have therefore reviewed studies of change which 
deal with the technostructural or human-processual aspects of the organization, and 
have found these to be reasonably effective beginnings for organizational change. 

In general, human-processual approaches (survey feedback, group, and inter- 
group methods) have a number of positive effects on the attitudes of those involved. 
There ts little evidence, however, that organizational processes actually change, or 
that performance or effectiveness is increased. Of the technostructural approaches, 
sociotechnical systems has the clearest effect on performance, while all three meth- 
ods (sociotechnical, job design and enlargement, job enrichment) tend to increase 
satisfaction with work. These findings, however, may be colored by the fact that 
almost all technostructural interventions have been done with blue-collar or lower 
level white-collar workers, whereas most human-processual interventions were done 
with white-collar management groups. In blue-collar jobs, where more concrete 
criteria of performance are available, increased effectiveness may be easily measured. 
On the other hand, higher level white-collar jobs deal increasingly with abstract 
performance criteria that make measurement difficult. Little is reported in the 
literature about using human-processual approaches with blue-collar workers, or 
about using technostructural approaches with management level employees. This 
may be due to the inapplicability of different methods to different levels, or it may 
reflect the difficulty in measuring abstract criteria as noted above. 

In one sense, sociotechnical systems, job design/enlargement, and job enrichment 
are theories about the appropriateness of certain social and technological structures 
rather than change methods. They have emerged from theories which attempt to 
understand and explain the currrent condition of personal, social, and technical 
systems. On the other hand, survey feedback, group, and intergroup methods have 
emerged from efforts to invent processes for development, influence, and movement. 
They may be methods for getting a change process moving, but it is never clear what 
their structural implications are. Increased integration of the two approaches, we 
think, will increase the present capacity of OD to influence organizational effective- 
ness toward both human fulfillment and task accomplishment. 

We believe that OD needs to explore various technologies, different sorts of 
change agent-client relationships, and alternative values currently emerging in our 
society. OD needs a framework to encompass this diversity that would encompass 
different values, different technologies, and different relationships contingent on 
different planned outcomes of intervention. 

Some of the potential richness and diversity of OD are represented in its applica- 
tion to wider planned social change activities. Goodstein & Boyer (90) intervened 
on behalf of the community in a municipal agency; Walton (169) tried to develop 
solutions to international border questions in an intergroup workshop with country 
representatives; Wedge (172) describes an effort to intercede between antagonists in 
international conflict; Levin & Stein (115) and Duhl & Steetle (67) report interven- 
tions or attempts to intervene in community conflicts. These efforts have been 
marginally successful at best, and they may represent only the publishable tip of an 
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iceberg of complete failures. OD today is a long way from being the general theory 
and technology of planned social system change we would like to see it become. 

Why so? If we are to develop a more general theory of planned change, what 1s 
necessary? OD has been developed within and for an extremely narrow range of 
organizations: by far the most common consumer of OD to date has been business 
and industry. A more general theory of planned change must come to grips with 
a number of issues that OD at present ignores. 

The future of OD rests in part on its values and the degree to which its practice, 
theory, and research are congruent with those values. Thus far, most OD 1s initiated 
by the organization—for the purpose of furthering such organizational goals as 
increased performance. We seldom question the merits of these goals as part of the 
OD process. Though most OD practitioners and researchers in some degree value 
both organizational task accomplishment and human fulfillment, there is an orga- 
nizational press in favor of the former. OD as a field runs the risk of encouraging 
and implementing subtle but persuasive forms of exploitation, curtailment of free- 
dom, control of personality, violation of dignity, intrusion of privacy—all in the 
name of science and of economic and technological efficiency. Within the hierar- 
chical fabric of everyday organizational power struggles, OD researcher/consult- 
ants typically represent the contro] needs of management. The needs of those lower 
in the organization for a higher quality of life, for an expanded range of occupational 
and life choices may seldom be known or acted upon by the consultant. The only 
choice for many such employees is to remain in or Jeave their organization. Ross 
(142) points out that many value discussions in the field at the moment skate over 
some fundamental questions about present societal arrangements: OD may well be 
another organizational palliative, engaged in “making some people happier at the 
job of making other people richer” (142, p. 583). We need to consider in more depth 
what values are central to planned change and what implications those values have 
for client selection (or creation), permissible strategies and tactics, appropriate use 
of fees and other compensation. OD as a field is faced with decisions about the 
balance it can and will strike between changing institutions to increase human 
development and changing people to promote institutional development. The two 
goals are rarely consistent with each other. 

Theoretically most of OD has adopted what Walton (166) has called the “attitude 
change strategy” for social change, which is based on shared information, trust 
among the parties, collaboration, and free choice. Less attention has been paid to 
the “power strategy” for social change, which is based on information management, 
distrust, conflict, and coercion (though many OD interventions borrow the power 
of the establishment to enact changes). More effective OD will require more elabora- 
tion of the theory of power from the points of view of both the powerful and the 
powerless (e.g. Gamson 85), and so will a broader theory of planned change. 

Finally, broader applications of a theory of planned change will require expanded 
intervention technologies. Such technologies might include techniques for training 
advocates (51), tactics for organizing a power base and a client system where none 
existed before (8), technologies for information management and control for social 
ends (161), or rational strategies for the management of violence (134). 
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We are saying that current OD theory and practice may be a small part of a rich, 
broad, far-reaching, relevant field of planned change. As the wider field is legiti- 
mized and developed, and as broader technologies and theories are developed, these 
will feed back into and enrich the field of OD. 

If the practice and theory of OD is to merge into a broader field of planned 
change, what role will research play in this transformation? We believe that research 
will either play a far more crucial role in the advancement of this field, or become 
an increasingly trrelevant appendage to it. Thus far it has utilized its techniques 
primarily for evaluation and validation, and its current techniques are well adapted 
to this. Thus far it has chosen to play a relatively uninvolved and distant role in the 
change-practice situation. Thus far it has focused on producing data for research 
needs rather than practice needs. As a result, we have theory from an external 
research prespective only. We have generally failed to produce a theory of change 
which emerges from the change process itself. We need a way of enriching our 
understanding and our action synergistically rather than at one or the others 
expense—to become a science in which knowledge-getting and knowledge-giving are 
an integrated process, and one that is valuable to all parties involved. We believe 
that a theory of planned change must be a theory of practice, which emerges from 
practice data and is of the practice situation, not merely about it. 
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OBJECTIVE DIAGNOSTIC 
TESTS AND MEASURES 


Lewis R. Goldberg" 


University of Oregon and Oregon Research Institute, Eugene, Oregon 


In a previous chapter in the Annual Review of Psychology covering a quite 
different topic, Tulving & Madigan succumbed to the torment of the Annual Review 
author and wished aloud that “... at least some writers, faced with the decision of 
whether to publish or perish, should have seriously considered the latter alternative” 
(149, p. 442). If this final solution to the problems faced by reviewers seems at all 
unappreciative, consider the following dilemma: (a) the present chapter, the first on 
a topic that will now be reviewed once every 3 years, must be held to approximately 
25 printed pages; (b) in 25 Annual Review pages, if one avoided all text whatsoever, 
one could include a bibliography of about 675 titles; (c) during the 3 year period 
1970-1972, the period chosen for this review, there appeared over 750 titles on one 
instrument, the MMPI, alone (12), and thus it would not be possible even to list 
merely a bibliography of MMPI reports; (d) each year there are over 3000 titles 
(including books, chapters in edited books, and journal articles) of some relevance 
to this topic (12), or close to 10,000 for the entire period. So, clearly, this review 
must be ruthlessly selective, hacking away most of this bloat with machete-like 
abandon. 

Towards this end, efforts have been made to avoid duplicating the material 
covered in chapters on assessment, human abilities, personality, attitudes and opin- 
ions, personnel selection, scaling, and test theory in previous Annual Review vol- 
umes, and in chapters on personality, genetics of behavior, social and cultural 


‘The author wishes to acknowledge the contributions of Endel Tulving and Stephen Madi- 
gan (149), who, while never having met the author nor having seen a draft of this chapter, 
have provided quite substantial guidance by previous example. Any errors in the manuscript, 
therefore, are clearly their fault. Kudos, on the other hand, might be sent to the author, who 
is always most appreciative. As he is for the financial support which permitted him to under- 
take this adventure, support provided by Grant MH-12972 from the National Institute of 
Mental Health, U.S. Public Health Service. Investigators who feel neglected or misunderstood 
are encouraged to seek relief from the Public Health Service. 
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influences on psychopathology, human information processing, and statistics in the 
present volume. However, any such review shares a problem with the zone defense 
in professional football: namely that excellent scientific reports, like talented tight 
ends, can often slip through the cracks between the zones. Readers who feel that 
their favorite instrument was given short shrift, or no shrift, are encouraged to write 
the author, who may be able to provide them with an extensive specialized bibliogra- 
phy. 

In the current Annual Review of Psychology master plan, this chapter is part of 
the genus Abnormal and Clinical Psychology, and the species Assessment of Psy- 
chopathology. The “objective” in the title is meant to contrast with “projective” 
(whatever that may convey), not necessarily with “subjective,” as one might have 
guessed. The chapter itself is arranged as a diagnostic instrument. Since the material 
becomes increasingly more technical as one proceeds through it, one can assess his 
degree of involvement with the field simply by noting the page on which he moves 
to potentially greener pastures. ` 


FOR THOSE WHO HAVE WANDERED INTO THIS CHAPTER 
FROM ACROSS THE BORDERS... 


If you read only one book in this field every 10 years, the choice for you has now 
been made easy: read Jerry Wiggins’ Personality and Prediction: Principles of 
Personality Assessment (155). A magisterial integration of the concepts and the 
research from a field whose complex sprawl had easily bested all previous attempts, 
this textbook is the most important single volume written about assessment since 
1957 and the original edition of Cronbach & Gleser’s Psychological Tests and 
Personnel Decisions. From the opening chapter on correlation and multiple regres- 
sion to the penetrating analyses of five milestone assessment projects at the end, 
Wiggins presents an even-handed account of the conflicts and convergences of this 
inchoate field. Along the way, the reader will encounter such topics as moderator 
variables and the prediction of predictability, contingency tables and actuarial pre- 
diction, clinical prediction and its automation, decision theory and personnel assign- 
ment, observational techniques (including reliability theory, trait attribution, and 
behavioral analysis), structured techniques, and the role of theoretical models in 
assessment science and practice. Buy it; you’ll like it. 

At a considerably more technical level, one other book published during this 
period stands out as a signal achievement. Cronbach & Gleser (23) have returned 
again, now with two collaborators, Nanda & Rajaratnam, to produce a second 
methodological landmark. Their multifaceted generalization of classical reliability 
theory, entitled The Dependability of Behavioral Measurements (24), is an evolu- 
tionary reconceptualization of the fundamental measurement base for objective 
diagnostic procedures. Warning: This is not a book for the mathematically fragile 
or the conceptually timid. Translation: I found it hard sledding. 

Not so with two excellent compendia from the period, McReynolds’ Advances 
in Psychological Assessment, Volume 2 (94) and Butcher’s Objective Personality 
Assessment (13). The former includes chapters on the assessment of places, of 
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children, of managerial talent, of aggression, of intrinsic motivation, of therapy 
outcomes, and of psychotic behavior, plus a mercilessly long mother by the present 
author surveying the history of personality scales and inventories (54). The latter 
book, which focuses primarily on the MMPI, includes two absolutely superb chap- 
ters, one by Paul Mech) (98) and the other by Warren Norman (107). Mech! has 
returned to writing about assessment again, albeit in some obscure places (96, 97, 
99, 100). His writing style has changed over the years since that bodacious flurry 
of extraordinarily influential papers published between 1945 and 1960, and his 
papers now read as exquisite rambles through a tortuously labyrinthine phenome- 
nology. As a consequence, one leaves Meehl’s writing today with a sense of awe, 
rather than, as before, with a new point of view. Norman (107) is something else 
again. If Wiggins wins the literary prize for best novel-length work, Norman is the 
clear winner in the short-story-length class. Word for word, his Psychometric 
Considerations for a Revision of the MMPI contains more measurement wisdom 
than anything else published during the decade. 

This was the period of the revised test handbook, with five old friends of the 
practitioner-on-the-street spawning new ones. These mighty tomes, all rich sources 
of material on the merits and demerits of their respective tests, include Matarazzo’s 
comprehensively revised WAIS handbook, Wechsler’s Measurement and Appraisal 
of Adult Intelligence (91); Campbell’s Handbook for the Strong Vocational Interest 
Blank (14); the first volume of Dahlstrom, Welsh & Dahlstrom’s two-volume revi- 
sion of their MMPI Handbook (26); Megargee’s California Psychological Inventory 
Handbook (101); and Cattell, Eber & Tatsuoka’s Handbook for the Sixteen Person- 
ality Factor Questionnaire (18). All five volumes are real handbooks in today’s sense 
of that term, which means that none of them is small enough to be held with one 
hand. Though, to be fair, they look puny when compared to the gargantuan prod- 
ucts of O. K. Buros (11, 12). His two latest works, which might logically be called 
handsbooks since neither is small enough to be held with both hands, include the 
monumental Seventh Mental Measurements Yearbook (covering the period 1964- 
1970 and 1157 tests) and Personality Tests and Reviews (which, among many other 
significant things, reprints all reviews of personality measures from the first six 
Yearbooks). Most psychologists know that if they need information about a particu- 
lar test, they should consult Buros’ latest Yearbook. Less widely known is the fact 
that virtually all books about assessment are listed there, each followed by lengthy 
excerpts from most of the reviews these books received in professional journals. For 
information about published tests and related books, there is simply no better 
source. 

The key word in that last sentence was “published,” obviously only the tip of the 
assessment ice. Tailor-made instruments are often used in psychological research, 
and reports of their psychometric properties are sometimes included in Journal 
articles. No one systematically reviews that stuff (other than Psychological Ab- 
stracts, which catalogs everything—and, thus, nothing). Now there is such a survey, 
at least for measures used with children: Johnson & Bommarito’s Tests and Mea- 
surements in Child Development (71) includes 309 instruments, well classified and 
well described. If you wandered in here from developmental psychology, you’ll want 
to order this one at the same time you order the Wiggins volume. 
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FOR THOSE WHOSE INTEREST IN THIS CHAPTER IS AT 
LEAST PARTIALLY PRACTICAL OR APPLIED ... 


If you are into the practice of psychodiagnosis these days, many authors will teach 
you how to do it right (meaning their way), while many more will urge you to jump 
ship and give up the enterprise altogether. By and large, the former write or edit 
books, while the latter write chapters and journal articles. Exceptions to this general 
law are Raymond B. Cattell and Hans J. Eysenck, who do both things—and often. 
Eysenck alone had 60 publications during this 3-year period, including 6 books 
(actually 8, since one edited work was in three volumes [43-45]). For Eysenck’s 
approach to diagnosis, see (40, 42). For Cattell’s position, see (16). For the views 
of others, try another chapter or two in the edited book by Mahrer (88)—particu- 
larly that of Lorr (82)—or pick up the short booklet by Dahlstrom (25). 

Of those critics who fight to make you switch, one of the best is Albee (1). Such 
pugnacious types, in turn, are opposed by some highly muscled hardhats with a 
heavy biological emphasis, among them Mech) (97, 99), Rimland (125), Rosenthal 
(128), and Gottesman & Shields (61). If there’s anyone around who hasn’t yet read 
Rimland’s audacious argument against psychogenesis, ran—do not walk—to the 
nearest library and start reading it. Rimland is doubtless wrong, as are we all, but 
after you have read his piece, you will never think quite the same way again. Which 
means that you may be ready to compromise. If so, read Begelman (5), Gough (62), 
and Draguns & Phillips (33). 

Pecking away at people who use traditional diagnostic syndromes has been a 
preoccupation among psychologists with a behavior-modification or social-learning 
orientation towards psychochange (120). Much of the time spent in this business has 
been devoted to exorcising such hobgoblins as traits, diagnoses, medical models, and 
similar demons. Apparently most of this dangerous work has now been accom- 
plished, and some authors can turn their attention to other pursuits. For recent 
perspectives on the role of assessment in behavior modification, see the reviews by 
Kanfer (72) and by Goldfried & Kent (58), as well as the important theoretical 
article by Mischel (104). And for an assessment book that behaviorists will find 
congenial and informative, see the volume edited by Costello, entitled Symptoms of 
Psychopathology (21). This handbook includes chapters on the assessment and 
treatment of such “symptoms” as depression, anxiety, phobias, obsessions and 
compulsions, tics, stuttering, thumbsucking, enuresis, hyperactivity, sleeplessness, 
alcoholism, drug addiction, academic underachievement, exhibitionism, homosexu- 
ality, asthma, headaches, and other horrors—only a smattering of the ailments of 
any graduate student. 

Indeed, students can often convince themselves that their brains are turning 
mushy. If this is your own hypothesis, it might be wise to consult one of the recent 
volumes on the assessment of brain damage, either Goodglass & Kaplan’s Assess- 
ment of Aphasia and Related Disorders (60), a test manual for the Boston Diagnos- 
tic Aphasia Test, or Russell, Neuringer & Goldstein’s Assessment of Brain Damage: 
A Neuropsychological Key Approach (130), essentially a test manual for the Ha/- 
stead-Reitan Impairment Index. Three related volumes are focused primarily on 
children: (a) a report of a superb empirical study, Deutsch & Schumer’s Brain- 
Damaged Children: A Modality-Oriented Exploration of Performance (29); (b) 
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another monograph reporting a study of cognitive perceptual motor dysfunction 
(Rubin et al 129); and finally (c) Wender’s new textbook, Minimal Brain Dysfunc- 
tion in Children (152). 

Disturbed children tend to maintain a low profile in the eyes of the community 
at large, at least until one of them begins to steal things or to assault others, when 
he becomes a part of that national epidemic called “crime in the streets.” Among 
the many publications which relate at least tangentially to the prediction of criminal 
behaviors are the theoretically exquisite volume by Schachter, Emotion, Obesity, 
and Crime (132); a revised edition of Eysenck’s Crime and Personality (41); a 
monograph on underachievement by Kipnis called Character Structure and Impul- 
siveness (74); and two new books by the high priests of this field, Sheldon and 
Eleanor Glueck (49, 50). One of their volumes, Toward a Typology of Juvenile 
Offenders (49), reports the results of an empirical study in which three types of 
delinquents were discovered. The other book, an edited volume on the identification 
of predelinquents (50), should be read by everyone in assessment—at least by 
everyone who is willing to be exasperated, provoked, and stimulated at one and the 
same time. Some of the contributors to this short book discuss the empirical evi- 
dence on the validity of the Glueck five-factor table (based on interviewer ratings 
of (a) kind of paternal discipline, (b) type of maternal supervision (c) degree of 
paternal and (d) maternal affection, and (e) cohesiveness of the family) in predicting 
—or, unfortunately, most often postdicting—delinquency. The great mass of “‘evi- 
dence” from retrospective studies indicating that the system correctly identifies (I 
kid you not!) 91%, 81%, 91%, 89%, 90%, 90%, 91%, 92%, 100%, 100%, 82%, 
81%, 86% of various delinquent samples in the U.S. and 92%, 92%, 85%, 88%, 
90%, 80%, 91%, 90%, 98%, and 92% of various delinquent samples in Japan, 
Puerto Rico, England, France, and West Germany is clearly not scientific evidence 
at all, since these studies included data from only two cells in the overall fourfold 
table, always omitting the proportion of nondelinguents who would have been 
correctly and incorrectly diagnosed. On the other hand, that rising righteous indig- 
nation of yours may be wrongeous, if one is to believe the results of a few genuinely 
predictive studies reported in later chapters of this book; the apparent validity of 
the Gluecks’ prediction scheme is uncanny—in fact, it seems better than reliability 
theory would suggest is possible. Clearly, there is something funny going on here, 
and it’s about time that somebody takes a highly critical peek at it all. 

If delinquency is a phenomenon which is quite predictable, at least by some 
standards, then suicide is near the other pole of this continuum. Two reviews of the 
scientific literature on suicide prediction were published during this period, one by 
Lester (78) and the other by Brown & Sheran (10). Neither is very encouraging. Nor 
are Bloch & Goodstein (7), who have reviewed the empirical evidence on the links 
between personality measures and various speech disorders, including stuttering. 
Their verdict: no links. 


Diagnosis Based on Naturalistic Observation or Life History Data 


A two-volume potpourri of conference papers, entitled Life History Research in 
Psychopathology (126, 127), provides brief reports of some recent longitudinal 
investigations. The first volume is focused largely on studies of schizophrenia, while 
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the second includes a wider sampling of problems (e.g. delinquency, alcoholism, 
narcotic addiction, and suicide). In an important related paper, not included in these 
compendia, Overall (109) discussed a 2000-patient exploration of the relationship 
between marital history and psychopathology (the more previous marriages, the less 
present pathology, regardless of present marital status). Moral: It’s not what you’re 
doing, but what you’ve done, that counts. This conclusion is hard to assimilate with 
those from a series of investigations (Myers et al 106, Rahe et al 121) which suggest 
that the number of life changes one experiences is positively related to the incidence 
of physical and emotional problems. (Ostensibly, having many marriages and di- 
vorces means having a lot of life changes.) However, the life-change findings have 
come mainly from military samples, where the incidence of extreme psychopa- 
thology must be rather low, while Overall’s conclusion stems exclusively from a 
study of psychiatric patients, where the incidence of nondeviancy is definitionally 
near zero. Investigators looking for something fun to do might try combining the 
methodology of both approaches, using a heterogeneous sample of subjects. 

Back in the old days (meaning right now at most traditional psychiatric settings), 
diagnosis was a single-stage, global, completely intuitive process: patients were 
interviewed by a clinician, who then assigned the patient to a diagnostic class or 
type. Nowadays (meaning at some research-oriented settings), these traditional 
diagnoses are supplemented by others which stem from a multistage process that 
combines more molecular intuitive Judgments with empirical classification proce- 
dures. Specifically, the clinician, at the end of his diagnostic interview (142), now 
may complete one or more standardized rating scales, recording forms, or behav- 
ioral checklists. The clinician’s responses to these precoded forms are then com- 
bined, by clerk or computer, to provide a set of syndrome scores. The profile of 
scores, in turn, is then used, again by clerk or computer, to assign the patient to a 
particular diagnostic cluster or type. The classification system (i.e. the entire set of 
types) may originally have been developed either intuitively or empirically via 
previous cluster analyses of large samples of score profiles. An example of the former 
is Spitzer & Endicott’s DIAGNO system (140), while examples of the latter include 
the taxonomies devised by Lorr (82, 83, 87) and by Overall (110, 112). 

These attempts to standardize, to fractionate, and thus to objectify the process 
of clinical decision making have been among the most significant achievements in 
psychodiagnosis during the past decade. Yet, since most reports of these develop- 
ments have been published in psychiatric journals, this work may not be as familiar 
to psychologists as it should be. For this reason, the present chapter will provide 
more detail about these procedures than about most others, for which extensive 
reviews are already available (12). 

The standardized report forms, which provide the initial data for all of the 
semiautomatic diagnostic systems, differ from each other in length, in focus, in the 
degree of structure they impose on the clinical interview from which the clinician’s 
observations and judgments are derived, and—of most importance—in the level of 
inference required of the clinician (159). At one extreme of this continuum are the 
most abstract types of depth interpretations found in some psychoanalytic inter- 
views; at the other extreme are the behavior frequency counts favored by investiga- 
tors with a social-learning viewpoint. 
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The widely used Brief Psychiatric Rating Scale (BPRS), developed by Overall and 
Gorham, lies on the abstract side of this continuum (115). A 7-category scale (not 
present, very mild, mild, moderate, moderately severe, severe, extremely severe) is 
used for rating 18 symptoms.? In addition, the BPRS can be scored for four higher- 
order factor scores (thinking disturbance, anxious depression, hostile-suspicious- 
ness, and withdrawal-retardation), each the sum of three separate ratings; for two 
syndrome contrasts (schizophrenic/depressive and coping/resignation); and for a 
total pathology composite (the sum of all 18 ratings). Recently, Overall has devel- 
oped the Factor Construct Rating Scale (FCRS), which uses the same type of 
7-category scales for rating 17 symptoms.” Cluster analyses of FCRS profiles have 
suggested 5 major patient types, which Overall has labeled depression, thinking 
disorder, extrapunitiveness, neuroticism, and agitation-excitement (110, 112). 

Lorr & Klett’s popular Inpatient Multidimensional Psychiatric Scale (IMPS) 
demands somewhat less abstract judgments than those required for the BPRS or the 
FCRS. Its 75 items include 45 symptom intensity ratings (each on a 9-category 
scale), 13 symptom frequency ratings (each on a 5-category scale), and 17 dichoto- 
mous items. Newly hospitalized psychotic patients were interviewed and rated 
independently by two raters using the IMPS; factor analyses of such data from a 
number of separate samples provide the data base for 10 IMPS syndrome scores.* 
Cluster analyses of IMPS profiles in various samples have led Lorr to conclude that 
about 60% of psychotics can be categorized into 11 types, 7 of which are the same 
for men and women (excited, excited-hostile, hostile paranoid, hallucinated para- 
noid, grandiose paranoid, and anxious-depressed), 2 are unique to men (disoriented 
and anxious-disorganized) and 2 to women (excited-disorganized and retarded- 
disorganized) (82, 83, 87). 

In contrast to the rather global taxonomic efforts of Overall and of Lorr, a team 
of investigators at the New York State Psychiatric Institute has been attempting to 
map the activities and behaviors of psychiatric patients in a far more articulated 
manner. Led by Spitzer & Endicott, this interdisciplinary group has produced three 
major rating schedules and a half dozen recording forms during the period of this 
review. The Psychiatric Status Schedule (PSS) was designed to provide multifaceted 
coverage of psychiatric signs and symptoms (including the use of drugs and alcohol, 
plus antisocial and illegal activities), as well as to evaluate impairment in diverse life 


*Conceptual disorganization, hallucinatory behavior, unusual thought content; anxiety, 
guilt feelings, depressive mood; hostility, suspiciousness, uncooperativeness; emotional with- 
drawal, motor retardation, blunted affect; somatic concern, tension, mannerisms and postur- 
Ing, grandiosity, excitement, and disorientation. 

"Mental disorganization, mental distortion, affective impoverishment, bizarre motor mani- 
festations, psychomotor retardation, agitation-excitement, extrapunitiveness, intropunitive- 
ness, depressive mood, elevated mood, passive dependence, dominant control, denial, 
intellectualization, over-responsiveness, neuroticism (obsessive, compulsive, or phobic symp- 
toms), and anxiety-tension. 

‘Excitement, hostile belligerence, paranoid projection, grandiose expansiveness, perceptual 
distortions, anxious intropunitiveness, retardation and apathy, disorientation, motor distur- 
bances, and conceptual disorganization. 
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roles (143). The 21-page PSS booklet includes both a standardized interview guide 
and an interrelated inventory of 321 precoded items, most of which are brief non- 
technical descriptions of small samples of verbal behavior (e.g. Query: “What kinds 
of things do you worry about?” Item 15: “Mentions he worries a lot or that he can’t 
stop worrying”). Factor analyses of item responses from an earlier version led to 
the development of 17 PSS symptom scales,? a denial of illness scale, plus five role 
impairment scales (for the roles of wage earner, housekeeper, student or trainee, 
mate, and parent). In addition, as with the BPRS, the symptom scales can be 
summarized by four higher-order composite scores (subjective distress, behavioral 
disturbance, impulse control disturbance, and reality testing disturbance), as well 
as an index of overall role impairment. 

A close relative of the PSS is the new Psychiatric Evaluation Form (PEF), an 
attempt to cover much the same ground using far more global ratings (38). The PEF 
is housed in a 9-page booklet, which contains 27 rating scales and 2 checklists. 
Nineteen symptom scales are scored,® plus the same 5 types of role impairment as 
in the PSS. Another new instrument in this series is the Current and Past Psychopa- 
thology Scales (CAPPS), which includes 28 items from the PEF, 13 others of some 
incremental diagnostic significance, and 130 life history items (37). CAPPS scoring 
keys, again based upon factor analyses of previous protocols, include 8 indices of 
current pathology,’plus 18 life history scales.® In addition, the 171 CAPPS re- 
sponses can be used to provide a computer-based psychiatric diagnosis, via the 
DIAGNO-II program (140). 

PSS, PEF, and CAPPS, which were designed primarily as research instruments, 
are multipage booklets with separate scoresheets which must be keypunched prior 
to computer processing. Such instruments are hard to sell to busy practitioners, 
whose devotion to paperwork may not be so fueled by scientific ambition. Nonethe- 
less, clinicians must be inveigled into cooperation if the diagnostic scientists are to 
achieve their dream: a large-scale systematic research project in which standardized 
observational data are collected on all patients admitted to every hospital in a state, 


SDepression-anxiety, daily routine or leisure time impairment, social isolation, suicide or 
self-mutilation, somatic concern; speech disorganization, inappropriate affect or appearance or 
behavior, agitation-excitement, interview belligerence or negativeness, disorientation or mem- 
ory impairment, retardation or lack of emotion; antisocial impulses or acts, drug abuse, 
reported overt anger; grandiosity, suspicion-persecution-hallucinations; and alcohol abuse. 

Depression, anxiety, suicide or self-mutilation, somatic concerns; speech disorganization, 
agitation-excitement, hallucinations, inappropriateness, disorientation or memory loss; bellig- 
erence-negativism, denial of illness, grandiosity, suspicion-persecution; retardation or lack of 
emotion, social isolation, impairment in daily routine or leisure time; narcotics-drugs, antiso- 
cial; and alcohol abuse. 

"Reality testing, depression-anxiety, impulse control, somatic concern, disorganization, 
obsessive-guilt-phobic, elation-grandiosity, and overall role impairment. 

®Depression-anxiety, impulse control, social and sexual relations, reality testing, depen- 
dency, somatic concern, obsessive-compulsive symptoms, anger-excitability, mania, sexual 
disturbance, memory and orientation, disorganization, organicity, neurotic childhood, phobia, 
retardation, hysteria, and intellectual performance. 
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or better yet, in a multistate region. As a compromise between the scientists’ need 
for data standardization and response completeness and the practitioners’ need for 
brevity and simplicity, a number of new recording forms have been developed. 
Among the most impressive of these is an integrated set of six psychiatric record 
forms developed by the Spitzer & Endicott team (141). These forms, which were 
developed for a collaborative project involving 7 states and the District of Columbia 
(the Multi-State Information System for Psychiatric Patients), were designed for 
recording psychiatric status and history within a computerized record-keeping sys- 
tem. All are 8 1/2 by 11 inch forms which can be automatically processed by an 
optical reader for transmission directly into a computer and/or for storage on 
computer puncheards, The Problem Appratsal Scales (PAS), a one-page summary 
of the patient’s presenting problems upon admission to the hospital, includes 38 
5-category rating scales covering physical, intellectual, and social symptoms. The 
Psychiatric Diagnosis Recording Form (PDRF) is a single-page checklist for 
recording one, two, or three diagnoses from the “official” list provided, plus the 
diagnostic stage (admission, provisional, reevaluation, or final). The Mental Status 
Examination Record (MSER) is a 4-page condensation of the PSS, which includes 
156 symptom rating scales and 101 checklist items. A hospital (PER) and a commu- 
nity (PER-C) form of the Periodic Evaluation Record permit a brief evaluation of 
patient functioning during the past week; each of these single-page forms includes 
28 scales and 45 checklist items from the MSER, thus permitting analyses of patient 
change. Finally, the Psychiatric Anamnestic Record (PAR), a 4-page form with 152 
rating scales and 266 checklist items, covers the information typically included in 
a complete psychiatric case history. 

Another enterprising effort to standardize and automate clinical records is being 
carried out by Sletten, Ulett, and their associates at the State of Missouri’s Division 
of Mental Health (34, 138, 151). This team has been developing a computer-based 
network called the Missouri Automated Standard System of Psychiatry, which now 
serves 10 hospitals and 9 clinics within the state. Virtually everything normally 
collected from and about each patient is fed into the computer’s hungry maw, 
mostly via single-page checklists which substitute for the transcribed records of 
yesteryear. The Missouri Automated Mental Status Examination (136) includes 111 
symptom-descriptive items which are scored on two dozen or so factor analytically 
based scales. These scales plus another couple of dozen demographic variables have 
been used in stepwise multiple-discriminant analyses to predict the final psychiatric 
diagnoses for 12 types of patients (139). Studies using large samples and a cross- 
validation design have suggested the utility of these linear functions for purposes of 
diagnosis, as well as for predicting length of stay in the hospital, likelihood of 
running away, and potential for committing suicide and for assaulting others 
(134). 

Other single-page Missouri checklists include: (a) the Missouri Inpatient Behav- 
jor Scale, 90 items relating to ward behavior; (6) the Community Adjustment 
Profile System, 100 items about social functioning obtained from the patient’s 
relatives or friends, which provides 10 scale scores; (c) a form to record demo- 
graphic data plus all movements between installations; (di a checklist for the 
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emergency room or admissions desk; (e) one to record the findings from a physical 
examination; (f) a form for the social worker to record information about social 
functioning; (g) an alcohol questionnaire; (4) a medicaid report; (7) an outpatient 
contact sheet; (/) a form for recording all medication data; (X) another for recording 
blood chemistry; (/) yet another for information about any nursing home care; and 
(m) various forms for business records. Even the Missouri psychometrician is in the 
act. While the Peabody Picture Vocabulary Test (a standard instrument in the 
system) is not yet administered by robots, patients sort the 550 MMPI items, each 
printed on a prepunched IBM card, into two piles (True vs False), which are then 
fed to the computer for scoring. With such a system it becomes possible to check 
a clinical hunch with a flick of the programmer. Want to know what sort of patients 
committed suicide in the state during the past few years? Just ask the computer to 
search its files, find them all, and list their combined characteristics. Voila (135)! 

The Institute of Living in Hartford, Connecticut, is another institution that has 
attempted to computerize evaluations. Their new instruments include: (2) a 650- 
item mental status examination which can be completed by the clinician via a 
television monitor integrated with a modified typewriter keyboard (Donnelly et al 
32); (b) a one-page machine-readable form for recording nursing notes, thus permit- 
ting an automated sequential analysis of patient change (Stroebel & Glueck 144); 
and (c) a single-page machine-readable interview schedule that provides delin- 
quency predictions in a more objective manner than has been possible in the past 
(Glueck 48). All of these efforts, however, are based on the use of prespecified coding 
categories (e.g. multiple-choice questions). Wouldn’t it be nifty, some have sug- 
gested, if the clinician or the patient could respond to the computer in their natural 
language, in narrative essay style? That day hasn’t quite arrived, but tt may be on 
the way. Eiduson & Ramsey-Klee (35) at the Reiss-Davis Child Study Center in Los 
Angeles have been designing a general information processor, called the Psychiatric 
Case History Event System (PsyCHES), which represents a step in this direction. 
And Benfari et al (6) at Harvard have been developing another such sophisticated 
system, called Computer Assigned Symptom Evaluation (CASE). 

If the reader hasn’t fallen asleep by this point, news of some additional L-data 
measures may serve to do the trick. A short symptom checklist for the diagnosis 
of schizophrenia, the New Haven Schizophrenic Index (NHSI), can be scored from 
hospital records; intuitively derived item weights are used to produce a total score, 
which purportedly is highly related to clinical diagnosis (3). Also working from 
patient records, Hautaluoma (67) has reported a methodologically sophisticated 
analysis of psychiatric syndromes and their relation to antecedent behaviors and 
treatment prognoses. In a similar vein, Martorano & Nathan (90) factor analyzed 
the 100-item Boston City Hospital Behavioral Checklist and related these factors 
to those from the IMPS and to traditional diagnoses. Similar types of analyses have 
been undertaken to compare the data from American versus English patients (Eve- 
ritt et al 39) and to identify and/or classify depressed patients (Paykel 117, Zung 
160). And when a 64-symptom instrument, the Symptom Distress Checklist, was 
factor analyzed, items in the resulting four factors were found to be highly related 
to the clusters obtained when 20 clinicians were instructed to group the 64 symp- 


OBJECTIVE DIAGNOSTIC TESTS AND MEASURES 353 


toms intuitively (Derogatis et al 28). Two overviews of this scene are now available, 
one by Ulett & Sletten (150) and the other by Stroebel & Glueck (145). 

Although the vast majority of these investigations have been focused exclusively 
on adult psychiatric patients, there have been a few studies of children. Among the 
most intriguing new child measures is the Personality Inventory for Children (PIC), 
which consists of 600 true-false items to be completed by the mothers of children 
between the ages of 6 and 16 (Wirt et al 158). PIC scales, which are still under 
development, include 11 constructed by the intuitive strategy,’ plus at least 8 
constructed by the external (empirical) approach "H Far more restricted in content 
is the Louisville Fear Survey for Children, an 81-item inventory that can be com- 
pleted by parents or teachers on the one hand, and by children or their peers on the 
other (Miller et al 103); a factor analysis of parents’ ratings of their children 
suggested three major kinds of fears: physical injury, natural events, and psychic 
stress. Teachers’ ratings of the problem behaviors of elementary school children 
have been factor analyzed by Miller (102) and by Dielman & Cattell (30, 31). 
Miller’s analyses have led to the development of his School Behavior Check List, 
which includes 6 subscales (low need for achievement, aggression, anxiety, academic 
disability, hostile isolation, and extraversion), plus a total disability score. Dielman 
& Cattell found 5 replicated oblique factors from a 62-item problem behavior 
checklist (hyperactivity, sluggishness, suspiciousness, withdrawal, and acting out). 
Modern kids must be turning soft; there were far more ways of getting into trouble 
back in the good old days. 


Diagnosis Based on Questionnaire or Self-Report Data 


Life history data normally must be encoded by human observers before being 
processed for assessment purposes, and we human observers have been known to 
have less than perfect fidelity for this transmission task. Various biases, context 
effects, and stereotypes plague us; our channel capacity is severely limited; and we 
get disrupted by boredom, fatigue, illness, as well as a host of situational and 
interpersonal distractions. In short, since humans are expensive middlemen, psycho- 
metricians often opt for self-report measures, which attempt to deal direct. How- 
ever, an individual’s social stimulus value is a mighty significant aspect of his 
personality, and individuals may not always see themselves exactly as others see 
them. So personality assessors pays their money and takes their choice: between 
L-data, smudged, warped, or otherwise distended by gliches in the transmission 
system; or Q-data, with its own full share of foibles and frailties. If we are wealthy 
enough, we can buy both products. And when we can figure out how best to meld 
them together, we will have achieved a scientific breakthrough of the highest magni- 
tude (57). 


“Withdrawal, excitement, reality distortion, aggression, somatic concern, anxiety, social 
skills, family relations, physical development, intellectual development, and asocial tendencies. 

‘°General maladjustment, delinquency, learning disability, sexual identification, cerebral 
dysfunction, intelligence, somatization, and defensiveness. 
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For the nonce, however, let us consider Q-data as a codified form of interpersonal 
communication used by one person to evaluate, assess, classify or diagnose another. 
The content domain of the communication, which is specified by the assessor, can 
include perceptions of observable past events (9), symptoms of emotional distur- 
bance (51, 92, 131), transitory moods (93), or any other beliefs, opinions, or attitudes 
of interest to the nosiest of snoops. For example, the Minnesota-Briggs History 
Record is an attempt to measure via questionnaire some of the variables typically 
obtained from L-data (9). Indeed, the inventory was initially developed for use by 
relatives of patients hospitalized for psychiatric problems, and only recently has it 
been converted to a self-report format. The new version, which contains 127 items, 
provides 7 scales (family disunity, conflict with parents, health awareness, introver- 
sion, school or job failure, social misfit, and breakdowns or addictions). While the 
Minnesota-Briggs was designed to measure enduring life history patterns, a new 
Protile of Mood States (POMS) was built to assess highly transient and fluctuating 
affective states (93). The POMS consists of 65 items, mostly state-descriptive adjec- 
tives (e.g. “shaky”), each to be rated on a 5-category scale (not at all, a little, 
moderately, quite a bit, extremely). Factor analyses of a larger set of items led to 
the development of 6 highly homogeneous scales, measuring the moods of tension, 
depression, anger, vigor, fatigue, and confusion. For a fascinating methodological 
footnote on the measurement of moods, see the report by Meddis (95). 

Psychodiagnostic intervews, for all their potential utility (142), have one clear 
liability, namely their extraordinary cost in professional time. The desire for an 
inexpensive substitute for a diagnostic examination led to the development of the 
very first personality scales (54) and subsequently the need seems to have continued 
unabated. Maultsby & Slack (92) have developed a computer-based system for 
obtaining case histories directly from the patient, the output being a report to the 
clinician. However, computers themselves are not all that cheap, and they are 
awkward to truck around on house calls. Clearly, a self-administered questionnaire 
would be the cheapest way out. D. P. Goldberg (a distant relative of the author, on 
the O’Hallahan side of the family) has recently attempted to construct another one, 
this to identify individuals with nonpsychotic psychiatric disorders and to assess the 
severity of their problems. The research monograph that reports his efforts (51) 
includes the response proportions from each of three types of subjects (psychiatric 
hospital inpatients, clinic outpatients, and a normal community sample) to each of 
140 symptom-descriptive items, the overwhelming majority of which differentiated 
significantly among the three samples. The results of a factor analysis of the 93 most 
differentiating items are also provided, along with data on the characteristics of a 
revised 59-item scale. Schachter (131) has been trying much the same trick, with 
the hope of identifying the schizophrenics among those pregnant women who regis- 
ter for prenatal care. And Selzer (133) has developed a 25-1tem Michigan Alcohol- 
ism Screening Test (MAST), which appears to work wonders at detecting those who 
admit drinking a good deal (105). 

Two multiscale psychopathology inventories have appeared of late, one of which 
has been published in the scientific literature and is thus free to the user, while the 
other has been published commercially and is indeed quite expensive. However, 
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unlike fast automobiles and fine wine, in this case the cheaper is also the better. 
Specifically, Lanyon (77) has given away his Psychological Screening Inventory 
(PSI), a carefully constructed 130-item instrument that provides scores on 5 scales: 
(a) alienation (similarity to psychiatric patients); (6) social nonconformity (simi- 
larity to incarcerated criminals); (c) discomfort or anxiety; (d ) expression (extraver- 
sion or undercontrol); and (e) test-taking defensiveness. In contrast, one writes to 
a post office box in New York’s Grand Central Station to find out about the new 
Experiential World Inventory (EWI), a 400-item wonder that is supposed to pro- 
vide scores on 8 scales: (a) sensory perception; (6) time perception; (c) body 
perception; (d ) self-perception; (e) perception of others; (f) ideation; (g) dysphoria; 
and (A) impulse regulation (36). For those who must see it to believe it, Table I lists 
some EWI items, along with a few concocted by the famous humorist, Art Buch- 
wald. Actually, Buchwald’s items are included simply for comic relief; every Annua/ 
Review chapter needs some comic relief. But the tabled EWI items are not all that 
unrepresentative of the entire set, a fact that makes the EWI manual a gold mine 
to item buffs. For in that manual are all 400 item endorsement proportions, sepa- 
rately computed within various samples of college students, neurotics, schizophre- 
nics, alcoholics, and prisoners. As an example, take that subtle item for vampire 
detection (312: “I would like to drink blood”). Roughly 1 or 2 percent of the 
individuals in normal samples endorse that item, while between 5 and 10 percent 
of those in schizophrenic samples do so. Happily, however, roughly the same 
endorsement proportions apply to EWI item 48 (“I cannot fully open my mouth”). 
Lest you get complacent, however, I must pass on the dire findings for item 102 (“I 
sometimes feel an urge to bite somebody”): 25% of male college students endorse 
that one, while only 1% of the coeds do so. Add one more problem for Women’s 
Lib. 


AND THEN THERE’S THE MMPI As alert readers may have remembered, there is 
always a lot going on in this arena, far too much to write about here. However, a 
few major developments probably should be mentioned, if only briefly. Among the 
most significant recent MMPI studies are three by Overall and his associates. Two 
of these were on alcohol abuse (116, 153), and the third was a match race between 
the MMPI and the 16 PF, comparing their ability at differentiating among 13 
different types of psychiatric diagnoses; the winner was the MMPI, by many lengths 
(157). In another excellent study, Lorr & Gilberstadt (84) joined forces to assess the 
relationships between Lorr’s IMPS rating types (L-data) and Gilberstadt’s MMPI 
profile types (Q-data); the verdict: very little overlap. Nonetheless, the development 
of new profile types continues (63), as does research on the characteristics of persons 
in each type (66, 80, 119). Hopefully these findings will be incorporated into the 
computer programs that are used to provide those automated interpretive reports, 
but don’t hold your breath. O’Dell (108) has shown that, at least for the products 
of one of these services, individuals are unable to distinguish between the accuracy 
of a flattering universal report and one prepared specifically from their MMPI 
responses. Three overviews of this business, by Fowler (47), by Kleinmuntz (75), 
and Manning (89) are now available, thus sparing me the pleasure. 
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Table 1 Would the real set of items please step forward? 


Set 14 Set 2° 
The sight of blood no longer excites Animals often try to fool me. 
Me. I feel like killing untidy people. 
It makes me furious to see an in- I sometimes taste sound. 


nocent man escape the chair. l , , 
. Someone is making copies of me. 
When I was a child, I was an 


imaginary playmate. I sometimes think other people’s 


thoughts. 
I am bored by thoughts of death. { 
Lam afraid somebody may cut off my 
I become homicidal when people nose. 


try to reason with me. : 
Old men are indecent. 


I don’t like it when somebody is 


rotten. I am someone else. 


Most of the time I go to sleep I would like to drink blood. 
without saying goodby. I feel that my ideas may turn into 


Frantic screams make me nervous. insects. 


4Written by Art Buchwald, the noted humorist. 
Included in the Experiential World Inventory. 


Many users have complained that the MMPI is too long. Since a good many 
items, especially those towards the end of the booklet, are not scored on the standard 
clinical scales, there have been numerous attempts to expunge them. Various expur- 
gated versions of the MMPI have been devised, the most popular being editions of 
399, 373, and 366 items. A recent trend is to whittle down this item pool even more, 
to the first 168 items, or to a selected 86 (the Midi-Mult), or even to 71 (the 
Mini-Mult) (65). Since the direction of this movement is evident, the future should 
see the MMPI reduced to bare bones, namely a few single-item scales. For detecting 
latent obesity, one needs only the lone item: “I can’t believe I ate the w-h-o-l-e 
thing.” The masculinity-femininity scale can be replaced by one true-false item: “I 
am a male” (54). And the neurotic scales can all coalesce into the single item: “I 
am a jerk.” 

Whatever one thinks of the present status of the MMPI, virtually no one will deny 
its extraordinary historical influence upon assessment research and upon psy- 
chodiagnostic practice. Eventually, however, old inventories, like old soldiers, must 
slowly fade away. Hopefully the next decade will bring the MMPI’s successor, 
perhaps the new Differential Personality Inventory (15, 147) if it can be lured out 
of hiding. In the interim, since the MMPI will continue to be used and used heavily, 
it should be employed as sagely as possible. At present it is used anachronistically: 
the original clinical scales (each developed empirically to differentiate a particular 
small sample of psychiatric patients from a larger nonpsychiatric sample) are now 
employed as the structural elements in fairly elaborate quasiconfigural profile 
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typologies, and an individual’s responses are interpreted in reference to the at- 
tributes that tend to characterize others with similar score profiles. However, as 
Norman (107) has argued so articulately: 


Whatever one thinks of the desirability of the original construction of the basic clinical 
scales (granting the purposes for which they were intended), it is abundantly clear that 
they are about as inappropriate and maladapted a set as one could imagine for their 
current uses in profile analysis and interpretation and typal class definition [p. 64]. 


Specifically, the original clinical scales have at least three major liabilities for the 
task for which they are now being employed: (a) each scale is highly heterogeneous 
in manifest content, and consequently it is next to impossible to attribute any 
unambiguous and/or content-coherent message to a particular scale score; (b) the 
set of scales is both substantively and structurally redundant, a problem that is 
severely exacerbated by items keyed on two or more scales; and (c) a sizable amount 
of potentially significant information from the initial item pool is not even available, 
either because it stems from the many items that are never scored or because sets 
of negatively correlated items are included in the same scale, thus effectively elimi- 
nating their effects. While these psychometric facts of life were not obvious in the 
late 1930s when the MMPI was being constructed, they are today. Consequently, 
until the MMPI item pool is replaced, it is time to move from the old clinical scales 
to a new set of content-coherent and structurally independent ones. 

Over the years there have been numerous attempts to factor various sets of MMPI 
items. In contrast to similar analyses of MMPI scale scores which typically extract 
around three factors, the various analyses at the item level have produced little 
agreement about the number of factors in the MMPI pool; the estimates have ranged 
from around three, when the late Robert Tryon tried his hand at this game (148), 
to some unspecified but far larger number in the rotoplots of Raymond B. Cattell 
(27). Barker, Fowler & Peterson (4) extracted nine factors, but then they only used 
373 of the MMPI’s 550 items. In any case, given a few dozen extra pages, I would 
argue the heresy that factor analysis of items is as risky an enterprise as 1s empirical 
item analysis using criterion groups. What is needed for the construction of new 
MMPI scales is a multistrategy approach that is maximally sensitive to item content. 

Wiggins (154) has used precisely such tactics in the development of his 13 new 
MMPI content scales. Always with a sharp eye on manifest content, while simulta- 
neously keeping his sights aligned by means of empirical homogeneity analyses, 
Wiggins has fashioned a set of MMPI scales that are far better suited for use in 
profile typologies than are the original clinical scales. Unfortunately, since the 
content scales haven’t been around as long, there are far fewer data available about 
them (118, 146, 156). As a result, the practitioner should be encouraged to employ 
both the clinical and the content scales, just as with the Strong Vocational Interest 
Blank he can now employ the statistically homogeneous and content-coherent Basic 
Interest Scales along with the classic occupational ones (8, 14). And, to past MMPI 
investigators, an urgent request: rescore those old answer sheets on Wiggins’ content 
scales, and run that vintage study once again. Two publications for the price of one! 
If anyone should object, tell them you obtained your authorization on these very 


pages. 
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Diagnosis Based on Test Data 


Inventories are often loosely referred to as personality tests, but they aren’t, and thus 
they shouldn’t be. One cannot be too sticky about this, however, since over the years 
the term “objective test” has been defined in extraordinarily diverse ways. As this 
chapter 1s not the place to resolve the arcane semantic issues involved, tests will be 
defined, quite dogmatically, by right of adverse possession if you will (or if you 
won't, by right of eminent domain): an objective test is one in which the examinee, 
without gross cheating, can alter his score in one direction at most; this definition 
is roughly equivalent to that proposed long ago by Cronbach for a “maximum 
performance” task. By this definition, most standardized measures of physiological 
activity and physical proficiency, as well as most aptitude and achievement examina- 
tions, qualify as objective tests. 

Within clinical psychology these instruments have rarely been used for differential 
diagnosis, with the notable exception of those used for the detection of brain dam- 
age. The latter measures really should be reviewed by someone whose knowledge 
of neurophysiology ts a light year or two beyond that of your reporter; consequently, 
the wary are hereby advised to skip immediately to the next paragraph. For you 
unwary, however, five excellent studies of the Halstead Neuropsychological Test 
Battery have recently appeared (59, 76, 79, 123, 124). Overall & Gorham (111) used 
the WAIS and a group-administered and computer-scored version of the Ho/tzman 
Inkblot Test to check the hypothesis that performance differences between brain- 
damaged patients and those with no such damage are similar to the differences that 
normally occur as a function of aging; for the two tests employed, that hypothesis 
proved dead wrong. In another report on brain damage in old age, Kendrick (73) 
discussed his Digit Copying and Synonym Learning tests. Finally, Lewinsohn and 
his students (81) have tested patients’ short-term memory via visual, auditory, and 
kinesthetic modalities, showing that the test performance of those with anterior 
damage is worse than those with posterior. Moral: Fall backwards. 

Welcome back, wary. As you probably know, the outstanding, and longstanding, 
advocate of the use of objective tests in clinical diagnosis is Raymond B. Cattell. 
Over the years, Cattell has been collating existing tests, devising new ones, and 
factor analyzing various subsets of his resulting collection. He has published two 
major volumes dealing with these efforts, as well as two versions of his Objective- 
Analytic Personality Test Battery. For so energetic a sales campaign, customer 
response has been sluggish. The problem has been to convince clinicians that objec- 
tive tests are more valid predictors of significant clinical behaviors than are other 
types of measures commonly available. Fabian & Comrey (46) made such a compar- 
ison and reached the opposite conclusion. Most of Cattell’s efforts, however, always 
go Into yet another factor analysis (17); this ploy, while unlikely to convert the 
heathen, is certain to stir up someone who enjoys the same bead game (Howarth 
69). Finally, however, Cattell has published a report on the relationships between 
objective tests and clinical diagnoses (19). It should have been a corker. Instead, it 
must go down as one of the most amorphous and prolix of a long line of publications 
not always noted for their crispness. The tragedy is that there probably is something 
important going on here, since Lorr and Hamlin (64, 85, 86) carried out reasonably 
similar types of analyses and found gold. 
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FOR THOSE ASSESSMENT SCIENTISTS WHO HAVE BEEN 
HANGING IN HERE ALL THIS LONG ... 


As usual, there is both good news and bad news. First let me acknowledge the bad 
news, which you already discovered for yourself before reading word one of this 
chapter: your best work (your entire production?) has been overlooked in the 
author’s pigheaded scramble to conform to page limitations. The good news: you 
get another chance 3 years from now. In the interim, what are the scientific problems 
on which we should all be working? The author has attempted to answer this 
question, for the broader discipline of which this is a part, in a paper published only 
a short while ago (57). In capsule summary, these problems include: (2) discovering 
the most important individual differences from the host observable; (5) measuring 
those attributes as precisely as possible; and (ec) finding optimal methods for employ- 
ing these measures in both basic and applied endeavors. No wishy-washy character, 
the author stands 1000% behind his previous choice. In the space remaining, let us 
look at some recent research on the last two of these three problems. 

Measurement normally begins with the presentation of a stimulus (e.g. an item 
or a question) to a subject, in order to elicit a precodified (e.g. multiple choice), or 
potentially codifiable, response. Such a response is rarely considered a measure in 
and of itself; a lone response may be both unreliable (in that future responses to the 
exact same stimulus would not all be identical) and unrepresentative of responses 
to closely related stimuli (generalization from single instances, like generalizations 
from single subjects, can be hazardous to your health). Consequently, the responses 
to a set of stimuli are typically accumulated and then averaged in some manner 
before reliable measurement is thought to have occurred. The basic problem in scale 
construction is selecting the particular set of stimuli, the accumulated responses to 
which are used to produce a score. There are three general approaches to this task; 
callously disregarding older terminology, I refer to these as the Intuitive, Internal, 
and External strategies of scale construction (56). Many test constructors have used 
a combination of two strategies—typically the Intuitive and the Internal (70) or 
more rarely the Intuitive and the External (62)—-and Meehl (98) now advocates a 
combination of all three. Jackson (70) has provocatively challenged the alleged 
superiority of the External over the Intuitive strategies, and Ashton & Goldberg (2) 
have risen to his bait. For two other such empirical comparisons, see Fabian & 
Comrey (46).and Goldberg (56). 

After measures have been constructed, they really ought to be used. For most 
purposes more than one measure will be used at the same time, so that the measures 
—like the items before them—typically must be combined in some fashion. Long 
ago two broad integration possibilities were distinguished: actuarial (or statistical) 
and subjective (or “clinical’’). When they are pitted against each other repeatedly, 
as in a cockfight, neither manages to kill off the other in a clean enough manner so 
that all of the betting money changes hands. Most psychologists have come to agree 
with Mech! (96), who long ago set up the gaming regulations. On the other hand, 
Holt (68) has always objected to those very rules, and thus he sporadically confesses 
his refusal to pay up. The major unresolved problems today (other than persuading 
Holt to do some research instead of exhorting others to do it for him) include, on 
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the actuarial side, a specification of the sort of prediction functions that are most 
appropriate for today’s measures, and on the subjective side, a specification of the 
optimal meld of empirical data with informed intuition. 

The first problem has been formulated as a comparison between linear and confi- 
gural prediction models (22, 52, 55, 98). As the arch-linearist on these battle lines 
at present, the author hereby asserts his belief that virtually all psychological rela- 
tionships are going to turn out to be nonlinear and configural in nature. When 
measures of individual differences become precise enough, then the various nonlin- 
ear prediction models will become useful. At the moment, however, our measures 
are still too gross to expect nonlinear models to outperform linear ones, given the 
clear statistical problems involved in the former’s selection and use. These problems 
are particularly poignant at the moment, since so much effort is presently going into 
the development of complex profile typologies and into the search for trait-by- 
treatment interactions; the latter include not only the traditional patient-by-drug 
interactions, but also some patient-by-therapist ones (20, 122). 

And finally there’s human intuition—much exalted, much maligned, often grossly 
misused, but typically all that’s around. Most sorts of clinical decisions are made 
subjectively if for no other reason than that at this primitive state of our science the 
appropriate actuarial data have not yet been analyzed. Ostensibly, practitioners who 
are called upon to make such decisions on a daily basis come to learn a bit about 
how to make them. Nonetheless, professionals are still only human and, as noted 
before, humans have their days: good and bad, up and down, topsy and turvy. A 
few years ago I pondered whether the practitioner’s inherent unreliability might not 
be separated from his—hopefully somewhat valid—judgmental policies (53). Since 
a simple linear regression model can often be constructed to capture critical aspects 
of many judgmental strategies, what would happen if one substituted one of the 
judgmental models for the expert himself? An empirical study of 29 clinical psychol- 
ogists attempting to differentiate the MMPI profiles of psychotic from neurotic 
patients suggested that models tended to function at least as well as the humans that 
were modeled (53). 

Happily, this finding has now been replicated in a number of subsequent studies, 
and various investigators seem to have arrived independently at much the same 
conclusion, albeit by somewhat different routes. For example, investigative teams 
in Texas (113, 114) and in Missouri (137) have begun to model the decisions of 
psychiatrists selecting the most appropriate drug for different types of patients. Since 
diagnosis has already been automated in some settings, this consequent automation 
of treatment decision making could serve to eliminate the clinician altogether. If you 
find yourself being tickled by this fantasy, just remember: you could be the very next 
to go... 
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INTERVENTION TECHNIQUES: 22] 
SMALL GROUPS 


Kurt W. Back‘ 
Duke University, Durham, North Carolina 


Research in the field of small group intervention is more than a simple application 
of social psychological techniques. Although the techniques of group intervention 
are analogous to those explored in social psychological experiments, the whole 
context of the intervention movement makes other considerations important. 


NOT A TECHNIQUE BUT A MOVEMENT 


Working with small groups, either ongoing or selected for the specific purpose of 
intervention, is possible in many different ways. Groups have been used for religious, 
ideological, and political indoctrination, and there has been concern with the func- 
tioning of family, work, and community groups for a long time. The current interest 
in intervention derives, however, from a particular constellation of developments in 
behavioral science without any societal goals. Nevertheless, we are looking here not 
at a development purely within psychology, but at a social movement with its sects, 
ideologies, and polemics. A knowledge of historical conditions is thus important in 
understanding the movement. We shall present, therefore, the historical background 
of current group intervention techniques, especially those connected with sensitivity 
training and encounter groups. After this orientation we can then discuss method- 
ological and theoretical issues. 

Two reasons are usually given for using small groups as a direct entry point to 
a change process: (2) specific features of group interaction can be utilized to produce 
the desired effect; and (6) a small group may be a strategic point of attack in an 
attempt either to solve large organizational problems or to effect change in individu- 
als. 

Small groups have been important tools in many contexts, especially in religious 
ritual and some of the dramatic performances which came out of it. In these cases, 
however, it is apparent that the group action was embedded in the whole system 
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of beliefs, ulterior aims, and rules which tended to justify it. Thus the religious 
system and ritual lent themselves little to direct concern with group intervention by 
itself, though some of the ritual revealed shrewd insight into group processes. The 
shift toward open interest in the technique for its own sake was adumbrated by 
Aldous Huxley in 1932 in his novel Brave New World (42). Here, describing a 
society which has rejected all ultimate values in favor of instrumental values related 
to immediate functioning, he realizes that group actions in the society remain 
important but have to be pursued for their own sake. Thus a bishop becomes a 
community songster and the people romp together in weekly group sessions. We 
shall return eventually to the important role of religion and peak experiences in 
small group techniques. 


HISTORY OF SMALL GROUP INTERVENTION 


Discussion 


Small group intervention proper may be said to have started with the concentration 
of effort on the group process itself rather than using it as an adjunct to another kind 
of procedure. Its earliest embodiments came from efforts in psychotherapy on the 
one side and personnel management on the other. Some additional developments in 
social psychology and education were also important. 

The earliest use of group process in psychotherapy can be credited to Jacob 
Moreno. Moreno combined psychotherapy and dramatic techniques in his creation 
of psychodrama (75). He felt that the two-person situation of the therapist and 
patient could not be used effectively to recreate all the interpersonal situations in 
which the patient was enmeshed and which may have led to his difficulties. He 
therefore created scenes in which the patient was able to act out important events 
in his life with a variety of other people representing significant figures in his 
environment (alters), being helped by people representing himself who could act out 
better or more strongly what he would like to do (auxiliary egos). Moreno external- 
ized the personal topography of mind by setting the stage in three circles, the highest 
and innermost of which represented the most private, intimate, and dominating 
thoughts, with more and more superficial parts ofthe personality represented by the 
outer circles. We find in Moreno’s early work the seeds of many of the later small 
group techniques. However, one of the great differences from later work is that the 
whole scene was staged for the benefit of only one patient. All the other members 
of the group were therapists or their helpers. Thus Moreno did not create group 
psychotherapy in today’s sense, but he coined some of the important concepts used 
in later times, among them the ideas of encounter and here-and-now. 

Moreno stayed quite apart from the mainstream of psychotherapeutic develop- 
ment. Orthodox psychoanalysis was unfriendly to any radical changes in therapy 
techniques, and Moreno’s own work occupied a niche somewhere between psycho- 
logical therapy and dramatic training. Although his work was transported from 
Vienna to Washington at Saint Elizabeths Hospital, it remained for a long time an 
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isolated phenomenon. One of the reasons might have been that he represented 
himself from time to time as a divinely inspired leader (74). 

Another impetus toward small group intervention came through psychotherapy 
from the Tavistock Institute in London, where psychiatric work was influenced by 
a theory developed by Melanie Klein, a psychoanalyst (48-50, 85). This theory, 
which sees in group and therapeutic processes a reenactment of age-old myths and 
is quite willing to spell out the connection between these myths and the therapy 
process, is more hospitable to group therapy than is orthodox Freudianism. Several 
of the psychiatrists at Tavistock Institute as far back as the 1930s used groups in 
psychotherapeutic endeavors, even taking over and familiarizing the term “here- 
and-now” (31, 97). 

Psychotherapeutic groups were also started in the United States, especially in 
institutional settings where the groups persisted through time, and where therapists, 
social workers, and psychiatrists could see how the group interaction could influence 
the therapy process and could be used to help it. Slavson (90, 91) at the Jewish Board 
of Guardians, Redl (78) in Detroit, and Bettelheim (18, 19) at the Orthogenic School 
in Chicago put patients in group situations for therapeutic purposes and in this way 
advanced some ideas of group psychotherapy. 

Another route for group methods to become influential was through the develop- 
ing interest in personnel management. Following the reaction against Taylor’s (93) 
theory of scientific management which concentrated mainly on man-machine rela- 
tionships, some industrial psychologists turned their interest toward the informal 
groups among the workers. The existence of these groups must have been noticed 
before, but their importance and possible use for management purposes can be 
traced primarily to one set of studies, the Westinghouse Studies by Mayo (72) and 
Roethlisberger & Dickson (79). These studies originated in a desire to investigate 
pure man-machine relationships, in this case the relationship between lighting and 
performance. The investigators found, to their surprise, that lighting was less impor- 
tant than purely human factors; they later directed their attention toward the 
actions of informal groups within the company. Their detailed observations of the 
informal joking relationships in a small work group are still used as the major 
illustration for theories of small group development in industry. 

At the same time a counseling program at Westinghouse was started which tried 
to improve the efficiency of the relationships within the plant by concentrating on 
tensions arising from personal problems of the workers as well as problems within 
the group organization. This program, probably the origin of group programs in 
industry, has since been discontinued; the discontinuance has given rise to contro- 
versy about the importance of the whole experiment and of small groups in industry 
(38, 53). At that time, however, different industries started similar programs of 
group orientation, among them the Harwood Manufacturing Corporation in Vir- 
ginia (14, 71) and Dewey and Almy in Cambridge (68). In both these cases the 
initiation of the programs was mainly due to the individual efforts of the personnel 
managers, Bavelas and French at Harwood and Douglas McGregor at Dewey and 
Almy. These men were later able, through their positions and financial help, to 
influence the development of the group intervention technique. 


370 BACK 


Another impetus for the group movement came through the New Deal programs 
of the 1930s. Some of these programs involved large-scale efforts to change systems 
using minimum personnel, and some even dealt with the founding of new communi- 
ties. For several of these projects, apparently independently, it was decided almost 
by necessity to use groups as targets for change and education. This was necessary 
because the needed amount of professional help was not available. Such was the case 
in the development of literacy programs, the creation of new communities, and the 
farm settlement programs. The last of these involved an application of Moreno’s 
techniques of sociometry and sociodrama on a large scale (67). The literacy pro- 
grams are important because they involved some of the future leaders of the sen- 
sitivity training movement. 

Development in the theory of sociology and social psychology also prepared the 
ground for the emergence of small group techniques. In sociology, the works of 
Simmel, Cooley, and Park were rediscovered and with them the importance of the 
small group (46). In social psychology the application of experimental methods to 
the study of groups made it possible to use the traditional laboratory methods of 
psychologists to investigate social problems. The work of Lewin, Sherif, Bales, 
Allport, Asch, and their students stimulated the interest of other social psycholo- 
gists in the study of small groups, and also provided examples of the use of observa- 
tion and manipulation of groups. In addition, these studies aided theoretical 
development by showing the influence of small groups on individual behavior. Such 
notions and concepts as Sherif’s (87) “frames of reference,” Hyman’s (43) “reference 
groups,” Lewin’s (59) “group standards,” and Asch’s (6) “conformity pressure” 
contributed to the formation of a conceptual framework for understanding and 
manipulating small groups. 

Events surrounding World War II and its aftermath furthered the development 
of all these trends and also helped to bring them together. Programs aimed at 
forming crews of soldiers, such as air crews, into functioning groups as rapidly as 
possible led to the refinement of group formation techniques and helped, for in- 
stance, the Tavistock Institute to develop many of its methods (31). This made 
psychiatrists more accepting of group procedures and led them to develop therapy 
groups which could be run by less highly trained professionals and where different 
patients could help each other. Wartime dislocation necessitated a high premium 
being placed on efficient personnel management and cultural integration of new 
workers. An additional effect of the war was that many social psychologists in 
academic life were thrown into contact with the everyday programs of nonacademic 
organizations, especially the armed forces. Training in an actually functioning oper- 
ation illustrated for them the importance of group methods to assess individual 
performance. Studies of combat effectiveness showed the importance of loyalty to 
small groups in the U.S. Army (92). 

Postwar problems also seemed amenable to small group techniques. These prob- 
lems included the readaptation of prisoners of war to home conditions (28, 29), the 
reduction of intergroup tensions (95), and the improvement of labor relations (36, 
45). Thus the immediate postwar years saw a great flowering of group intervention 
techniques and the foundation both of places where small group techniques were 
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studied in an academic way and also of institutes whose members applied techniques 
of direct group intervention. 


Start of the Movement 


At this time academic, theoretical, social psychology laboratory experimentation, 
and the development of group intervention techniques were strongly allied, encom- 
passing a field known as group dynamics. Within a few years, however, these two 
wings separated and consequently it is possible to discuss group intervention as a 
topic in itself, although many of those in the field have retained overlapping interest 
in the two branches. 

The development of the group intervention movement was precipitated by some 
events which occurred in the immediate postwar period when the first group training 
laboratories developed a vehicle of small group intervention which has given the 
field its center and strength. This is a technique which has at various times been 
called T-group, sensitivity training, and encounter group. The impetus came from 
the almost fortuitous discovery that group techniques become more impressive and 
important for the members if the content of the group discussion is turned upon 
itself. That is, efficient group techniques are those in which members discuss what 
happens in the group, and the main content of the group work is the relationships 
within the group. Thus the main function of the group technique becomes the 
meeting or encounter, and the interest is on the here-and-now, the two concepts 
foreshadowed by Moreno. The technique, which was originally discovered in a 
laboratory on intergroup relationships at New Britain, Connecticut, in 1946, was 
consciously expanded in the first set of laboratories organized by the National 
Training Laboratory at Bethel, Maine (26, 52, 66). 

The NTL program, which was initiated under the guidance and theoretical orien- 
tation of Kurt Lewin and carried on after his death by his associates and students, 
was also aided by some people who had worked on group development in its earliest 
stages, including Leland Bradford and Kenneth Benne. It was also aided by associa- 
tion with the Tavistock Institute in England. The history of the movement can be 
summarized by division into four periods: the Heroic Period from 1946 to 1950, the 
Dormant Period of the 1950s, the Explosion of the 1960s, and the Crisis which 
developed toward the end of the 1960s (8). 


The Heroic Period 


The first 5 years were almost completely concentrated at the Bethel laboratory, and 
different techniques were being used every year. There were questions on how to 
adapt this new method to different kinds of applications in the behavioral sciences; 
whether, for instance, to give it a somewhat psychiatric content and make it a sort 
of therapy for normals. The format, length, and content of the sessions, and the 
degree of emphasis on applicability were experimented with; different people with 
different backgrounds were attracted and invited. Other problems were faced: the 
question of group composition—whether they should be strangers or people work- 
ing together; the question of location of the laboratory—-whether or not it should 
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be away from the general habitat of the group members, a sort of cultural island; 
and other problems. Different kinds of research were tried and abandoned in at- 
tempts to evaluate the program, and the laboratory became, in consequence, a 
training ground for new types of research. 

Finally, by about 1950-1952, the program jelled. This coincided with a change 
of leadership in that the people who were originally leading the groups were kicked 
upstairs to be consultants and leaders of the NTL, and a new group took over as 
the actual group leaders. The format of the whole program stabilized and the 
T-group or training group, which became for a long time the mainstay of the 
movement, was created and developed. 

A parallel development occurred in England at the same time. The Tavistock 
Institute had been functioning as a psychiatric institution with interests in group 
therapy and group processes (31, 44). Some of their ideas were similar to group 
dynamics. Following experiences with the British armed forces, a group at Tavis- 
tock evolved large-scale intervention techniques for use in industry and rehabilita- 
tion centers. Tavistock initiated a special brand of group intervention techniques, 
still following the psychiatric model, which looked at the client organization as a 
patient. There was some contact and mutual influence between this group and the 
group at Bethel; in the late 1940s W. R. Bion wrote a series of articles on group 
intervention which influenced many practices on both sides of the Atlantic (later 
collected in Bion 20). 


The Dormant Period 


At this time the National Training Laboratory had settled down to a routine, though 
it was still novel for many. It trained people from different countries and different 
environments, and in this way increased its influence. In addition, many of the 
trainees and associates at Bethel went to other places and started innovative and 
different kinds of centers. On the West coast some centers were started which were 
more interested in individual training than in making people more efficient in 
groups. There the term “sensitivity training” was coined. One of the more remark- 
able events during this time was the dissemination of the technique to other coun- 
tries in the Caribbean and in Europe where the technique was taken up in several 
different contexts. Europeans, especially the French (54) and Scandinavians (39), 
adopted group intervention techniques as part of political reconstruction and have 
remained much more conscious of the general social implications of the movement 
than their American counterparts. In the meantime, the original center of the 
movement at Bethel tended to stagnate and needed a new impetus. 


Explosion 


In the 1960s several events occurred which led to a sudden expansion of the 
movement. The National Training Laboratory reorganized itself and, like its Eu- 
ropean counterparts, sought to influence the seats of power. They did this by 
organizing programs for leaders such as special presidents’ laboratories where indus- 
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trial leaders were invited for training. These laboratories were successful and helped 
to pull many industries, businesses, community organizers, and government agen- 
cies into the Bethel orbit. The whole institutional structure was reorganized by 
having training programs for people who later could become members or associates 
of the program and be given assignments to lead workshops in various parts of the 
country with the stamp of approval of the NTL itself. In the workshops at NTL, 
programs were adapted for different kinds of consumers, such as community or 
recreational leaders. 

A different influence came with the foundation of the Esalen Institute in Califor- 
nia (8, 41). Esalen tried to serve several purposes, including at first giving seminars 
on new ideas for changing behavior and controlling the mind. These ideas were often 
inspired by Oriental philosophy and religion. Perhaps the somewhat pragmatic 
influence of the earlier years, especially in the National Training Laboratory, had 
run its course and group intervention needed some new religious infusion. The 
Esalen Institute used group techniques but called them encounters, and was more 
directed toward individual development than toward group action. The encounter 
group became popular, Esalen was quickly publicized in many national magazines, 
and group encounters or T-groups became a symbol of the life-style of the upper 
middle class. The popularity of this new branch contrasted with the more conserva- 
tive Eastern approach. However, there was some contact between the two centers, 
and some of the new Esalen techniques were tried at Bethel and the labs which 
Bethel influenced. It was clear to some in the mid-1960s that the approach used in 
the mid-1940s would not have to be followed slavishly, just as the innovators of the 
1940s were not bound by the techniques of experimental social psychology of the 
1920s. 

By the end of the decade three distinctly different centers had developed. One was 
the old National Training Laboratory, which by that time was part of the National 
Education Association. The NTL retained its original emphasis on group efficiency 
and group progress, although it tried to use some of the human relations approaches 
from the West. It was very much concerned with establishing consulting relation- 
ships with government and business agencies, and also with keeping its respectabil- 
ity by avoiding extreme experimentation and keeping a moderate position. 

By contrast, Esalen and the human potential centers which followed it encour- 
aged experimentation of all sorts. They kept the encounter technique as the central 
attraction while incorporating new ideas from Indian, Japanese, and Chinese philos- 
ophy, and later from Persian Sufism, astrology, mysticism, and various massage 
techniques. This was the movement which caught the popular imagination. One of 
Esalen’s allies was the Western Behavioral Science Institute, headed by Car] Rogers, 
which took some of Rogers’ ideas on counseling and adapted them to group counsel- 
ing (80, 81). Ideologically the WBSI was probably close to Esalen, but it stayed more 
within the psychology framework from which its leaders came. 

The third center was the Tavistock Institute in London. Here intervention tech- 
niques, in the medical sense, were applied to factories, housing projects, and delin- 
quent groups, diagnosing the problem and trying to adopt remedies specific to the 
situation. In addition, the Institute held general group sessions, but the clientele 
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were restricted to professionals in human relations, and the content was directed 
toward general understanding of interpersonal processes. 


Crisis 

By the end of the 1960s, the bulk of the movement had overexpanded and felt 
financial and organizational strain. At the National Training Laboratory this was 
brought to a head by the fact that the long-time director, Leland Bradford, retired 
in 1970 and a successor had to be found. In addition, various financial problems had 
to be solved. One attempt at getting a new director failed because the candidate, 
Warren Bennis, tried to redirect the whole laboratory back to some of its original, 
socially imaginative programs and also combine it with a graduate university where 
these new skills could be taught. A new director was finally agreed on, and he made 
some fiscal changes, including division of the NTL into different organizations 
depending on the kind of client population (8). 

At the Western Behavioral Science Institute a direct split occurred with the more 
humanistic members leaving and founding the Center for the Study of the Person, 
while the more conventional group remained at the Institute. At first the WBSI not 
only lost many of its people but also all of its contracts, while the new center 
attracted some research and promotional work. However, at a later time, the Insti- 
tute recovered although it did less encounter work and more work, often very 
imaginative, in the education and poverty areas. 

Esalen and its centers changed their technique too, deemphasizing the approach 
of attracting masses of people for weekend and week-long encounters and trying 
instead to train a kind of dedicated follower by having longer sessions where people 
were inducted into communal living. In Esalen’s urban center in San Francisco, 
however, they found that encounter had stopped drawing people; the big drawing 
cards now were occult sessions on the one side and physical kinds of manipulation 
and training on the other. Thus in the personal growth centers, the percentage of 
the time devoted to encounter groups has probably decreased, and it may soon be 
that this field cannot be called small group intervention techniques any longer but 
instead a sort of personal therapy and influence. 


THEORY 


Reality of Groups 


With this heterogeneous background and current development, it is easy to see why 
there is no codified theory to which all group interveners subscribe. However, there 
is a Surprising amount of agreement on general principles. The first article of faith 
is the principle of the importance of the small group as a carrier of change, its 
influence on the individuals who are its members, and its influence on the organiza- 
tions to which it may belong. The prime area of agreement here is that the group 
exists as a meaningful and real unit. General theories of social psychology and group 
dynamics are neutral on the reality of groups. They claim that the group can be 
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considered as a unit, but also that the structure of and the interpersonal relationships 
within the group must be considered. In the theories of group intervention, the 
group is more of a real unit, while in the general field of social psychology a 
nominalistic point of view, considering the group as a collection of individuals, exists 
side by side with a consideration of the group as a whole, as more than the sum of 
its parts. 

The basic idea of group intervention came from aspects of Lewin’s work (56, 60). 
The group is looked on as a kind of spatial, structured unit similar to personality 
constructs in Lewin’s other work. Each of the places within the group is then filled 
by a role which is necessary for the survival of the group, even if at the moment 
there is no individual to fill it. Therefore, a group is a collection of positions and 
roles, and not of individuals. This point of view was developed originally by Morton 
Deutsch (30) and taken up by Benne & Sheats (16) in describing the first National 
Training Lab. The nature of the functional roles within the group is differently 
defined by different theories, but the idea has persisted in spite of periodic neglect. 

The unity of the group is expressed most clearly in the work of the Tavistock 
Institute, where the group is taken as an actual unit and treated almost like a patient 
who is hurting because certain of the group functions are not being fulfilled. The 
basic Tavistock technique is then to diagnose the hurt or the symptoms as they relate 
to the actual working group, and to retrain the people to fulfill the functions of the 
group better. 

The theory on which most of the Tavistock work has been based derives mainly 
from W. R. Bion (20). He saw the main task of a group as doing productive work, 
but that this is inhibited by “derived functions”—interpersonal relationships which 
consist of dependency, fight-flight, and pairing. Only if these three functions are 
worked through and are in balance can real, productive work be accomplished. Bion 
believed also that the same relationships hold true in individuals, and that the 
imbalance of derived functions leads to illness. Thus the Tavistock theory, originally 
the most clinically and psychoanalytically based approach of all the intervention 
theories, takes the group most seriously as a real unit. 


Change Technique 


The group dynamics theory, which prevails in the mainstream of the intervention 
movement at NTL, looks at the process differently. Lewin’s legacy is shown in his 
last two articles on quasi-stationary equilibrium (57) and on feedback (58). The first 
term was introduced to describe how attitudes or behaviors of individuals are 
anchored within group norms, conceptualized as a force field with a gradient depen- 
dent on the attitudes of the people in the person’s immediate environment, his 
immediate group membership (57). Pushing a person away from the place where 
he is and away from the mean of the group pushes him, in effect, against stronger 
and stronger forces, and hence the moment that the push stops, he will slide back 
to the original position. The only effective ways of changing a person are either to 
change the whole group or to unfreeze the person who wants to change by abolishing 
the forces which anchored him in the group and making him choose a new group 
as anchor. 
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This theory illustrates the two techniques which are used in application to small 
group intervention. Persons are being changed as part of the whole group during 
the workshop which effects the change. The second technique is used when the 
person faces the problem of coming back to the old environment. In this case, he 
is like the person who is suddenly removed from the force field which has changed 
him, so that he slides back to his old equilibrium. To counteract that, several devices 
are used. One consists of treating the whole back-home group in the workshop. This 
is the technique used by the Tavistock Institute and by industrial consultants such 
as Blake & Mouton (21). It is not, however, the technique of the more orthodox 
group laboratory trainers. A compromise ts to take several people from the same 
organization into a training group so that a small, independent subgroup can be 
formed in the new environment which may relieve the pressures of the larger 
back-home group. In general, however, great pessimism exists about the possibility 
of long-range change. One popular theory is that carryover is only possible under 
conditions where the trainee has power. Therefore, it is probably easier to change 
a person’s behavior in his family than in his work situation. 


Feedback 


The second concept introduced by Lewin is that of feedback (58). Viewing the 
output of a group as a topic of conversation or as input into the group action, the 
group may discuss for a long time a fight which has broken out, how many impor- 
tant incidents have occurred, or their feelings toward the leader. These are topics 
which have occurred “right now” in the group, and are also those in which people 
have been recently emotionally involved. Discussion of these issues brings out 
people’s defenses, emotions, justifications for what they did, and possible under- 
standing of why they did it. Thus this kind of reinforcing feedback might make for 
either a benign circle or a vicious circle, depending on one’s point of view. 


Utopian Theortes 


Thus intervention theory is based, in part, on theories of social psychology, personal- 
ity, and psychiatry. However, it would be wrong to suppose that this theory has 
remained completely within the framework of experimental social psychology and 
theoretical group dynamics. Many people involved with the movement, although 
well versed in traditional theories, would object that most of them are too flat, too 
mechanistic, and view the person too much as an object of social forces (4). The key 
points in this objection are a need to look at the individual as an emotional and active 
being and an aversion to the use of single variables and the experimental kinds of 
manipulation which characterize current social psychology. The model which such 
theorists as Schutz (83, 84), Argyris (1, 2), and Aronson (5) produce would use 
interventions and the affect they arouse as the most effective way to understand 
behavior. But they would also admit in general that a world outside the group does 
exist and, perhaps to their sorrow, accept the similarity between present circum- 
stances and some of the current textbook theories. They have to propose an alterna- 
tive model of human behavior and use this model for their consulting work. 
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All the theories on group intervention have a certain ambiguity in common. They 
are not so much theories of what is, of how groups function, as of what could be, 
of how groups should function. They are not theories of how people adjust to their 
world, especially to their social world, but of an alternative way of behaving. They 
are less an application of a psychological theory than a vision of a new world and 
a new social movement. We shall see later how this utopian orientation affects 
research in the field. 


RESEARCH 


Normally, the procedure in discussing an applied field like small group intervention 
would be to start from the small group theories, going through the experimental 
evidence which has been cited by the theories, and then deal with the problems of 
application and research directed toward that field. The historical context of the 
present discussion, however, makes this procedure unfeasible and unrealistic. Much 
of the theory of small group intervention has common origins with the sensitivity 
group movement. Their history has since diverged, in fact has diverged so much that 
many proponents from either side have doubted the relevance of the one for the 
other. Thus, there are some joint interests found only in the earlier times and only 
a few fragments of linkage between the two can be used in discussing the research 
evidence for small group intervention in more recent times. 


Early Laboratory Studies 


The early development of experimental social psychology dealt primarily with 
groups in the laboratory. But even the first study, a study of authoritarian, demo- 
cratic, and laissez-faire groups in the laboratory by Lewin, Lippitt & White (61), 
dealt with a problem which was based on small group intervention. It was a question 
of which way groups should be formed, which way groups would function more 
effectively, in terms both of outcome and of effects on their members. This study 
may be methodologically questionable; by today’s standards the independent vari- 
ables are somewhat complex and vaguely defined, and some of the experimental 
controls were difficult to execute in such a way as to render the results unambiguous. 
However, for a while this line of research was followed by Lippitt in the assessment 
and training programs of the Office of Strategic Services (76) and by Lewin and 
others in studies on group decisions (59). 

In all these studies there was an effort to make the group the aim of the study 
and also to show how the group actions would be more productive under some 
conditions of participation by the members, especially under conditions in which the 
members themselves could feel that they were part of the decision-making process. 
In the meantime the first sensitivity group programs in Connecticut and Bethel had 
started, and they were influential in initiating a line of basic research. One of the 
more ambitious offshoots of this research effort was the so-called Conference Re- 
search Project by Marquis, Guetzkow & Heyns (70). 

The common core of these studies showed a contrast between the traditional 
theories of organization and management and psychological theories. The idea of 
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a complete line and staff organization, where decisions arrive from the top power 
levels, may be passed in conference with relevant staff members, and then are passed 
on to the line for execution, seemed to be contraindicated by psychological factors. 
Although this method might lead to quick and efficient organization and be neces- 
Sary in emergency situations, it had a negative effect on the morale and performance 
of the participants. However, none of these studies was very conclusive about the 
gains and losses of either type of system or how they could be combined. It may 
be that this inconclusiveness was the root cause of the split into the work with group 
intervention on the one hand and social psychological research on the other. The 
results of working with different types of groups in a realistic setting were never clear 
enough to justify any definite conclusion related to social psychological theory. On 
the other hand, the confusion of factors inherent in such situations, such as the 
personality of either the leader or the members or the particular techniques used, 
led the experimental social psychologists to pursue more simplified situations which 
involved a retreat from dealing with groups. 


Network Studies 


In the first decade after the group dynamics movement started, the split was not very 
deep. Several research trends appeared which influenced the group intervention 
techniques and provided a theoretical underpinning for the movement. One was an 
attempt to distinguish more clearly between organization for task requiredness and 
concern for the morale of the participants. Bavelas (15) and Leavitt (55) developed 
a technique of restricting communication between members of a group through 
artificial channels and then studying the efficiency of group performance by the 
subjects. Thus a group could be put together where everybody could communicate 
with everybody else, or communication could be restricted by having one person 
control all channels. Between these two extremes were various combined and hierar- 
chical situations. This kind of research has continued to be a good way to test many 
hypotheses about group structure. 

In general, the studies found that the hierarchical] structure was most efficient in 
quickly finding a solution to simple problems, but it led to low morale in those 
people who were not involved in the decision process. The complete communication 
structure, which corresponded to the most democratic and participatory situation, 
led to high morale in the participants but frequently to low efficiency. However, the 
more complicated the problem was, the smaller was the advantage of the more 
hierarchical system. After these initial findings, this line of research went more and 
more into detatled questions about the relation between ability, task complexity, and 
organization, and finally into questions which were not relevant to group process 
(86). 


Group Observation 


A second research approach developed at that time concerned the direct observation 
of group process, especially of decision-making groups. This technique was evolved 
by Bales in cooperation with his colleagues and students, and consisted, at least at 
the start, mainly of two parts. One was the development of a system of observation 
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categories by which groups could be described in a standardized manner and differ- 
ent groups and techniques compared. The ideal product would have been a sensitive 
instrument for group measurement and evaluation which would be analogous to the 
Stanford-Binet for individual testing. This hope was not quite realized. However, 
although norms have never really been established, Bales’s final system of 12 catego- 
ries divided into task-related and emotional ones has been used in later group 
research (10). 

Bales’s second contribution was to use these categories to establish a model for 
the normal progress of groups and the regularity of certain phenomena within 
groups. Bales used as models certain decision-making and problem-solving groups 
including case studies, conferences in a clinic, and discussions of organizational 
problems. He found a regular course of progress in these kinds of situations in which 
emotional and task-related categories assumed differential importance. The emo- 
tional categories were dominant at the beginning and at the end of each problem 
discussed, while the task-related categories predominated during the discussion 
itself. He also found that, over the course of a group, two kinds of leaders tended 
to emerge—one, the task leader, assumed leadership during the task-related discus- 
sion, and the other, the play leader, assumed leadership during the emotional 
discussion. These two leadership roles were quite distinct, were never filled by the 
Same person, and were sometimes even antagonistic toward each other (13). 

One would have thought that the availability of this kind of method and the 
number of hypotheses which could be derived through its use would have resulted 
in both a theoretical and an applied interest in pursuing the technique. However, 
the early interest in the observation of groups which led to small group laboratories 
being built in the 1950s has petered out (12). It is strange to see on how little 
evidence these theories, which are still basic in social psychology, are really founded. 
The original few studies by Bales & Slater (13) and Bennis & Shepard (17) have been 
praised, doubted, or attacked, but little replication or further research has been 
done. Contentions have also been made that some of the regularities observed were 
more a function of the fact that the participants knew they were being observed and 
thus showed their normative behavior, how they should act and not how they 
normally would act in a group (96). There has been little systematic study of groups 
since Bales, and most of the laboratories built for this purpose are used today for 
individual or two-person experiments. Bales himself continued his work more in the 
direction of classifying personalities than of building a theory of group development 


(11). 
Conformity 


A third approach that could have been relevant was a study of group-induced 
conformity. Based on Asch’s (6) original studies of conformity pressures, a set of 
experiments by Festinger and his associates showed the conditions under which 
group pressure is related to conformity, the conditions under which deviation oc- 
curs, how deviates are dealt with in the groups, and in general how the power of 
groups is used to keep them together and to influence their members (7, 9, 34, 35, 
47, 82). Here again, the first studies used groups, usually whole groups, to create 
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the conditions of cohesion and group influence. What emerged was the obverse of 
the dangers of complete line management, namely the terror and power of the group 
itself and its ability to distort individual judgment. Although these studies are 
recognized by the group interventionists, their relation to actual group settings has 
not been investigated, and they represent today more a classical field in group 
dynamics than a subject of current research interest. 


The Split in Research 


The influence of these studies on directing research in the setting of group interven- 
tion itself has been limited for several reasons. One is the declining importance of 
group studies in experimental social psychology. It is clear that if several people in 
a group are allowed to behave as they want, each of them can behave in very many 
different ways, and therefore a tremendous amount of variables will be put into play. 
In order to tease out the effects of different conditions, of the different personalities 
of the people involved, the interaction patterns, the situation, and so on, more and 
more different conditions have to be controlled. That is, the participation of the 
other people in the group may be simulated instead of real, the possible interactions 
may become more and more limited, and the time span of development of the group 
may become too small to imitate the real progression of a 3-week laboratory. As 
soon as the unwieldiness of groups in these respects was recognized, most research- 
ers in social psychology turned their interest instead toward individual behavior. In 
1954 Borgatta (25) forecast, in a half-serious parody, the emergence of the no-person 
groups where everybody will just be simulated; today, research has almost caught 
up with his prediction. 

Thus the development of social psychological research led to findings of less and 
less relevance to the study of group intervention. In turn, this might have been the 
basis of the interventionists’ mistrust, which grew larger and larger, of scientific 
methods and traditional research. Here the history went in the opposite direction. 
Faced with the difficulties of having such a complex situation, the interveners 
accepted these difficulties and became less and less interested in testable hypotheses. 
For a while the human relations training labs were used as sources for research data. 
Some studies came out of the early Bethel laboratories dealing with, for instance, 
roles within groups, influence, and group leadership. Some group experiments were 
also done by people who had been participants at Bethel, such as the study by Coch 
& French (27) on changing work habits in industry. An early study by Lippitt (66) 
followed up group members after they left the centers and investigated their later 
effectiveness in their work. These kinds of studies were done at the time when social 
psychologists still worked with groups and there were many researchers present at 
the labs. Some of the studies done at Bethel were used for doctoral dissertations in 
social psychology (40, 62, 94). 

It became hard to justify studies done in a group setting. Should the group 
laboratories be mainly used as settings where social processes could be observed and 
studied, or should the technique of intervention be studied, evaluated, and improved 
through research? The first alternative reached its climax in classes in which social 
psychology was taught by means of sensitivity groups, especially one class in the 
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Harvard Social Relations Department. Out of this class three books (69, 73, 89) and 
a great number of articles have come forth, all of them describing the group process, 
the relationships between different interactions, and the progress of the class. The 
amount of data the class provided also made it possible to develop new measurement 
techniques. Some of the development of Bales’s own scales was a result of this course 
(11), and some intricate content analysis techniques were worked out using data 
from it (33). However, the interest in the classes came more from the emotional 
impact that they had on the members at the time, and found expression in intense 
narratives such as Slater’s (89) more than in quantitative studies. 

These studies form a convenient dividing line between the early interest in sen- 
sitivity groups as a convenient site for psychological experiments and the later 
interest in their emotional impact for its own sake. Many of these later studies are 
mainly descriptions of the process involving some use of data, but never arriving at 
or testing general hypotheses. In general, they adopt the stance of the advocate who 
is trying to show that something does happen and that some effect can occur. 


Evaluation Studies 


It would indeed be surprising if these intense experiences occurring over a period 
of several weeks did not have any effect at all on the participants, considering that 
the much weaker influences which are exerted in social psychological laboratory 
experiments produce measurable effects. The question in all research is not whether 
anything occurs at all, but under what conditions it occurs and how it can be 
controlled. From the point of view of research it is much more important to show 
how the production of even a small effect occurs than it is to demonstrate changes 
in some people without knowing what caused them and what kind of people they 
are. Of course, in order to publicize the programs and to impress people that 
something is happening, the former approach is sufficient, but it can hardly be 
considered what is conventionally called research. 

The fate of evaluation research in this field shows the difficulties with which 
research on intervention is faced. As mentioned previously, there was an attempt 
in the first laboratories to evaluate the effects on the participants through follow-up 
and later reports. However, not enough conclusive results were obtained to make 
this kind of procedure attractive to most investigators. An analysis of 149 evaluation 
and process studies done in the 25 years since the beginning of sensitivity groups 
shows that very few used any behavioral measures taken after the laboratory ended, 
even fewer used reports from others than the participants themselves, and hardly 
any used control groups (8). The worst studies in these respects were done during 
the Dormant and Explosion Periods of the 1950s and early 1960s. During this time 
we find either reports that changes of any kind have occurred, or narrative descrip- 
tions of the process which give the reader a feeling of what is happening in the group, 
at least as far as the author-observer sees it. Similar studies are also reported 
involving applications of the technique to therapy and industrial situations and to 
situations aimed at the resolution of group conflicts (32, 51). Some of the most recent 
studies in the survey showed an improvement in methodology, which may be a 
hopeful sign. 
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The procedural inadequacies of most of the evaluation studies may represent a 
general reaction against the scientific method or at least against the methods which 
the experimental social psychologists use. The lack of findings forced the interveners 
to make a virtue of necessity. Many people in the sensitivity training movement 
reject the way most social psychological research, or psychological research in 
general, is being performed and proclaim that not only do they have a better way 
of changing and educating people but also a better way of doing research (3, 4). This 
is partly a counterattack against the flight of many social psychologists away from 
working with groups, and partly an attempt to show that the group process which 
is being advocated is also a process of research. Thus, articles on intervention 
techniques spend most of their time attacking traditional research methods and 
promoting new methods which supposedly correspond to the new kinds of groups 
which they are advocating. It might come to pass that only those social psycholo- 
gists who are indoctrinated with a belief in the higher value of the techniques which 
are promoted through group intervention will be recognized as being able to do 
research on these techniques. 


New Trends 


However, there are indications of a new trend in research. There is some interest 
among psychologists in examining some of the tenets of the encounter group move- 
ment. Although they are starting with some of the more primitive propositions, they 
eventually may find some way of going more deeply into the process itself. Some 
studies are now being done to explore the effects of nonverbal communication, 
touching, and closeness on group cohesion (24, 37). These are not necessarily studies 
of group intervention, but studies of some of the suppositions about how groups are 
formed. Another development has been at least one study, by Lieberman, Yalom 
& Miles (63), which is trying to evaluate encounter groups in a more overarching 
design by comparing several techniques, using two groups to represent each tech- 
nique, and also employing adequate controls. This study has yielded valuable infor- 
mation about the importance of the leader in the group, the problems of measuring 
danger to the participants, the high rate of casualties in certain kinds of groups, and 
the possibility of classifying the processes which occur. 

Finally, there have been some attempts to look at the groups from outside as social 
phenomena, to assess their function from a different perspective than that of the 
group interventionist. For instance, a hypothesis has been put forth that sensitivity 
groups, with their unstructured procedures and all-powerful leaders, tend to induce 
great anxiety, and that many of the techniques used in the training laboratories are 
ways of coping with this anxiety (22, 23). The current crisis period in the movement 
may lead to a decision within the movement itself to take a new course and may 
spur a new kind of research approach which will use different techniques than those 
previously envisaged by either the experimental social psychologist or the people in 
the movement itself. 

Some of the general concepts underlying group intervention techniques are based 
on theories in social psychology. However, the approach used in the development 
of the group intervention techniques is different from that of traditional social 
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psychology. Many of the earlier experiments, such as the Westinghouse and Har- 
wood studies, really laid the foundations of some of current social psychological 
theory. However, the later development of this theory is held by many of the people 
engaged in intervention techniques to be too one-sided. 


Utopian Research 


The ideas at the base of intervention techniques are not so much a theory corre- 
sponding to the world as it is as they are a theory of the world as it could be, itself 
transformed through these intervention techniques. This point of view becomes 
particularly apparent among the leaders in the group intervention movement who 
are themselves social psychologists and have in a way worked on both sides of the 
street. Thus Argyris (3) has practically defined two worlds, A and B. Society A is 
the world in which the general laws of current social psychology and experimental 
work hold but which is controlled by norms, goal directedness, instrumental activi- 
ties, and passivity of the observed object. On the other hand, society B is character- 
ized by openness, absence of norms, the possibility of expressing emotion, and an 
emphasis on interpersonal relations. Argyris admits that this kind of society is rare 
and at the present time is practically only found in the world of T-groups. What 
he does neglect is, of course, the fact that T-groups have their own norms, norms 
of expressing emotions and so on, which exert as much influence on the group 
members as societal norms, but in a different direction. 

Argyris’s two types correspond to similar divisions in other theories such as 
McGregor’s (68) X and Y and Likert’s (64, 65) authoritative and participative 
systems. It might be safe to say that these theorists are proposing two different kinds 
of societies or groups, two different kinds of norms by which it is possible for groups 
to function. The new set of norms, the Y or B or participative (system 4), whatever 
the new group is called, might be easier than the old to set up as an ideal and a goal 
for certain kinds of changes in group functioning. 

Even when theory is discussed purely in the abstract we can see the relation 
between description and analysis on the one side and action and utopian goals on 
the other. Where the social psychological tradition persists in treating current 
circumstances as invariable facts of human nature, the extreme on the other side is 
to treat goals and utopias as proven characteristics of viable and existing systems. 
The theory underlying intervention, therefore, is intimately tied up with its applica- 
tion to particular settings and purposes. 


MEANS AND ENDS 


We can now see more clearly the functions of group intervention. Small groups are 
the locus of intensive emotional interaction and are also a vehicle for achieving many 
aims in society. Intervention by group process can be used to make the group more 
efficient in pursuing its purpose, but intervention at this level and under these 
emotional conditions gives the process an added meaning beyond the group’s osten- 
sive purpose. We recall in this connection the original function of group intervention 
as part of religious ritual. 
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We can start with the ostensive goals of the intervention. These goals may be 
directed toward some influence on the individual members of the group, on the 
functioning of the group itself, or on the larger organization of which it is a member. 
In the first case the aim may be teaching skills to the members; it can be a technique 
to Introduce new methods in schools or professional training. Or it may aim to 
induce a deeper personality change, in which case it becomes an adjunct or even 
principal feature of psychotherapy. The last case is exemplified in the use of groups 
in personnel management. The group intervention method is used as a way of 
increasing the efficiency of a business, industry, or governmental or voluntary 
organization. In the extreme, of course, it might be used as a method of reconstitut- 
ing society, which is probably the approach some of the European, more politically 
oriented leaders are using. 

There remains the second case, the movement of the group into itself. Here, in 
a way, the emphasis on group process is shown in its purest condition. When the 
effects on the participants are relatively weak, the groups serve a kind of recreational 
function, which has given the “psych-resorts” such great popularity and mass 
recognition. When a stronger effect is sought, the groups may actually aim at 
becoming a reconstructed society or alternate life-style, such as was proposed by 
Argyris, Likert, and McGregor. It is here that the original connection of small 
groups with religious efforts becomes most prominent and most ominous. 

In all these situations, except perhaps the last one, there is a certain incongruity 
between means and ends. Strong emotions and attempts at involvement of the 
members are used for ends which are, after all, usually quite pedestrian or trivial. 
In fact, this incongruity is one of the principal difficulties in any evaluation of 
intervention techniques. It seems obvious that something happens to people during 
these intense emotional sessions and the immediate reaction of most people seems 
to confirm this. However, detailed investigation of long-range changes or effects 
which are congruent with the ostensive purposes of the intervention leads to ex- 
tremely ambiguous results. The more adequate measurement techniques are often 
used in studies which find the smallest changes (8). 

Scientists have also been concerned about the negative effects or the dangers of 
the group intervention techniques. Here again, little research has been done to define 
the dangers. As a matter of fact, the definition of danger within the groups has 
become a pointed issue. Perhaps following the religious model of the soul going 
through darkness to light, some encounter leaders do not take evidence of nervous 
breakdowns, psychotic episodes, and the like as evidence of bad effects of the 
program, but say instead that this is part of the development of the person toward 
a higher level (77, 88). In the same way, the group intervention programs in industry 
can lead some managers who are participants in the groups not to become better 
managers but to decide that work in industry is not for them. It can be argued 
whether this is really a deleterious effect; it cannot be argued, of course, that it is 
not what the particular industry paid for when it sent its workers to attend the 
group. 

Research on casualties will have to be done in a carefully controlled fashion, and 
consideration must be given to casualties other than merely those designated by the 
group leader. In fact, it has been shown that group leaders in general are the least 
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qualified persons to identify casualties (63). This, incidentally, is an interesting 
sidelight on the sensitivity which the leaders of sensitivity training have. Intensive 
follow-up studies of the kind used to look for positive effects are even more needed 
for assessing negative effects. However, most of the data on casualties as well as on 
positive effects are mainly anecdotal] and difficult to evaluate. 

Going beyond the questions of the effectiveness and danger of intervention 
groups, these groups provide an interesting site for studies of interaction, since in 
many of them the purely interpersonal aspects of interaction are heightened to a 
tremendous extent. Thus, much of the research on group intervention really has very 
little to do with the groups themselves, but uses them more as a real-life, real-time 
experimental situation for studying social interactions. 

Another kind of research which has not been sufficiently pursued involves looking 
at the intervention technique as a social phenomenon of our time. The existence of 
small intervention groups for their own sake is probably one of the more unique 
characteristics of our society. They have not prospered because they have been 
successful in achieving their stated aims, as evaluation research has shown, nor have 
they revolutionized methodology or brought about a new era of understanding of 
human behavior. Therefore, we have to look more at the social conditions surround- 
ing the people who lead and participate in groups of this kind. Little research has 
been done on this, but ideas for such research might include studying the recruit- 
ment of participants and leaders and assessing the importance of the group tech- 
niques in the general working of society. ` 

Thus from every point of view, from the history of development of the technique, 
from evaluation, research, and application, we are led to the relationship between 
small group intervention and the values of society. The change from the original use 
of the technique in early religious ritual to the secular, instrumental method of 
today; the experimentation with a new, explosive, important force within human 
relations and the tentative efforts to control it; the new search for peak experiences 
through group action—all these can be studied by investigating the group proce- 
dures, but they are also important in their relation to lasting cultural trends. The 
best way to understand the group intervention movement is to look at it simulta- 
neously as a psychological technique and a social problem. 
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The history of the area known as behavioral genetics (psychological genetics, 
sometimes shortened to psychogenetics) is, as mirrored in the pages of earlier 
volumes of the Annual Review of Psychology, but a short one. Fuller’s 1960 review 
(88), covering the period from 1955, showed that activity in the late 1950s was 
directed to practical human problems such as mental illness. A major technique in 
work with humans was the twin study; most animal work aimed simply to demon- 
strate genetic control, comparing strain, as the independent variable, with behavior, 
as the dependent. Crossbreeding of strains was done largely in the Mendelian 
tradition of trying to isolate single genes. Attempts to find gene-behavior pathways 
emphasized audiogenic seizures. Such work continues, but now a major part of 
psychogenetic research is biometric and quantitative. 

McClearn & Meredith in 1966 (161) observed that strain differences were com- 
monplace and to be regarded only as a starting point. Of major interest were 
advances in human cytogenetics which continue to provide insights. There were also 
some biochemical and single-gene studies. Most of the rat and mouse work reported 
were studies on strain differences, although selective and crossbreeding studies were 
more plentiful. 

In 1970 McClearn wrote a distinguished review for a sister volume, the Annual 
Review of Genetics (157), with a useful historical introduction, but the purely 
psychological story was not resumed until 1971 with Lindzey, Loehlin, Manosevitz 
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& Thiessen’s long and comprehensive review (150), which chronicled the further 
decline of simple strain comparisons and the increase of quantitative genetic analy- 
ses. The growth of studies using physiological analyses and the increasing interest 
in evolution were stressed. Meanwhile Childs (38) reviewed behavioral genetics from 
a viewpoint appropriate to the Annual Review of Medicine. 

The growing importance of the field is reflected in the increasing frequency of 
reviews which, according to the 1971 “master plan” of the Annual Review of 
Psychology, should appear at 3-year intervals. But increasing frequency means 
decreasing size, and the present review must be shorter than many of those men- 
tioned. Hence, in addition to maintaining an enforced selectivity, we have relied 
much more on previous reviews for background references than did our predeces- 
sors, using the five cited above to the exclusion of other earlier sources unless 
absolutely necessary. But their total of 1213 citations, with surprisingly little over- 
lap, makes them a valuable repository of information on which to base our current 
progress report, which reviews the literature from approximately late 1969 to early 
1973. 

Our organization is simple: the first section deals with work using infra-human 
animals as subjects, the second with work with humans. The animal section is 
organized broadly by methodology, progressing from relatively simple to more 
complex situations and designs, with a detour through a single phenotype, that 
relating to alcohol preference, as befits a topic putatively concerned more directly 
with real-life problems. This approach also governs the organization of the section 
on humans in terms of the traditional categorization of behavioral phenotypes, but 
it necessitates a terminal discussion of methodology. 


ANIMAL RESEARCH 


Reviews 


Topics include developmental problems (94), learning (257), audiogenic seizures 
(90), alcoholism (159), aggression (160), genetic homeostasis (120), and evolution- 
ary contexts (240, 269); a general review by Thiessen (239) provides a history of 
mouse strains. 


Major Genes 


Interest in major-gene analysis continues. Studies of neurological mutants in Droso- 
phila by Benzer and his co-workers have reached an exciting stage (see 150, p. 73; 
38, p. 380). By producing flies which are structural mosaics for genes controlling 
various abnormal behavior traits, including one named “drop dead,” they have 
succeeded in mapping gene location with reference to known marker structures, on 
the two-dimensional surface provided by the egg, with such precision that the origin 
of the defect can be pin-pointed (118). Thus in the case mentioned it has been shown 
that it is the brain and not any other part of the fly that must be involved. Analysis 
by this powerful genetic mosaic technique of other more complex behavior is prom- 
ised and will be eagerly awaited. 

In mice a study of the effects of four alleles differing in respect to tyrosinase 
activity at the albino locus showed that behavioral differences were unrelated to 
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either eye pigmentation or tyrosinase activity (241). When pigmented and albino 
segregants were tested for descent from a visual cliff using different lighting and 
platform heights, albinos were slower under all conditions (187). To discuss these 
findings, simply in terms of acrophobia added to a postulated photophobia is to 
minimize developmental history. A major gene which affects sensory receptors from 
birth must express itself in behavioral differences at maturity. Debate continues (23, 
268) over the efforts at conceptual clarification in this area (see 150, p. 72). 

Using a developmental approach, van Abeelen & Kalkoven (1) compared normal 
and Nijmegen waltzer mice on reflex and behavioral measures at several ages. Unlike 
major reflex differences which disappeared by weaning, behavioral differences per- 
sisted into adulthood. The deficient performance of retinal degenerate segregants on 
jump-out and one-way avoidance learning was shown to be a function of poor visual 
capacity, (259) whereas albino mice, if tested in dim red light, were not deficient. 
Albino segregants were progressively inferior to pigmented mice in a water-escape 
task (78). Since initial scores were identical, light intensity is unlikely to have been 
important. Mutations producing obesity in mice were detailed in relation to food- 
intake regulation by Fuller (89). 

Major-gene analysis has been approached through study of behavioral differences 
rather than mutation. Whitney (266) measured the tendency to vocalize among 
randomly bred mice, finding frequencies expected for a single locus with complete 
dominance. Collins (43) has now mapped the position of the audiogenic seizure- 
prone gene using this approach. 


Selection 


Progress in ongoing behavioral selection studies as well as new selections for behav- 
ioral characteristics, continues to be reported. The sophisticated selection experi- 
ment of DeFries and co-workers, in which mice were bidirectionally selected for 
open-field activity, exceeds ten generations (50). Realized heritability was reliably 
estimated at 13%. Because the selection lines (as well as controls) were replicated, 
it could be shown that the negative correlation between defecation and activity was 
not a fortuitously correlated response. Tests at So checked the situational generality 
of the selection response. Differentiation of strains by activity in boxlike and illumi- 
nated apparatus had elements in common with that in the open field. Activity in 
wheels did not differentiate the strains. 

Mouse strains bidirectionally selected for rearing did not differ systematically in 
adrenal or thyroid activity (2). However, Blizard & Chai (16), selecting for release 
of radioactively labeled iodine from the thyroid, found their strains differed in 
open-field defecation and water-maze learning. The results complement work on the 
Maudsley strains of rats selected for open-field differences and found to differ in 
thyroid funtion (see 161, p. 536). 

Lagerspetz & Lagerspetz (141) now report results to $,, of their bidirectional 
selection for aggression in mice, noting no change since S,, though strain differences 
in aggressiveness may be reversed by social experiences. Mice were selected over 
seven generations for learning activity in a T-maze (119). The authors’ analysis 
showed a small genetic component. In four generations of selection of mice for rapid 
water-escape learning, realized heritability was found to be low (78). 
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Roubicek et al (209) selected rats for high and low water intake for eight genera- 
tions at an ambient temperature of 22°C and for nine at 35°C. Selections for high 
intake at high temperature and low intake at the low were successful, but selections 
for increased intake at the low temperature and for decreased at the high were much 
less so. However, transfer of animals selected for increased intake at low tempera- 
ture to the hot environment produced an intake similar to that of the rats selected 
at that temperature, though the relationship between the two low intake lines under 
comparable conditions was less marked. This difference is reflected in the higher 
genetic correlation of intake in the two environments for the high lines compared 
with the low. No abnormalities in urine concentration, adrenals, pituitaries, or 
kidneys were found in these strains, which consequently should make good material 
for behavioral studies. 

A preliminary communication (26), which nevertheless reported 19 generations 
of bidirectional selection for shuttle-box avoidance in the rat, indicated an asymme- 
try comparable to that found previously in a similar experiment (see 150, p. 55). 

Imprinting is widely regarded as a biologically important behavior. Graves & 
Siegel (98) bidirectionally selected chicks over five generations and found asymme- 
try for response to selection favoring rapid approach, but not for slow approach. 
Three lines of domestic chickens, bidirectionally selected for mating behavior, were 
studied for the effects of androgen therapy on sexual behavior (162). Level of 
restored mating behavior depended on genotype. 


Alcohol Preference 


In an ongoing study, Eriksson (67, 69) bidirectionally selected for voluntary ethanol 
consumption in the rat. At S,, the low strain was more active in the open field. 
Examination of the metabolic differences between the strains (81) showed that the 
high alcohol strain had a higher brain serotonin content; the difference was more 
extreme after exposure to alcohol (6). This strain voluntarily drank to intoxication 
(68). 

Whitney, McClearn & DeFries (267) reconsidered earlier reports and presented 
new data on alcohol preference in rats and mice. They calculated various heritabili- 
ties, all consistently low, and questioned the assumption that heritabilities must be 
population specific. A study using two mouse strains and their filial and backcross ` 
generations has reported slightly higher heritability indices and considerable direc- 
tional dominance for nondrinking behavior (66). A similar analysis of morphine 
intake in the same strains encountered severe scalar problems (70). Good agreement 
with a two-gene model was found for preference and consumption in two strains and 
their filial and backcrosses, without demonstrated involvement of coat-color genes 
(91). Inter- and intra-strain differences in preferences in mice were also reported (71, 
194). 

Pursuing his search for underlying mechanisms, McClearn proposed a negative 
feedback model (158) and, with co-workers (223), reported that intraperitoneal 
injections of ethanol produced higher accumulations of blood acetaldehyde in a 
nonpreference strain. Odor has been implicated in ethanol aversion in BALB/c mice 
(178). 
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Three studies relate alcohol consumption to other behaviors in mice and rats. A 
factor analytic study related alcohol consumption to two factors, audiogenic reac- 
tivity and disorganized behavior (195). An analysis using filial and backcrosses 
found alcohol preference in mice to be negatively related to open-field defecation and 
urination (265). Higher voluntary intake, however, was found in strains of rats 
selected for high emotionality and high avoidance (212). At two symposia (80, 222) 
material on the value of animals in alcoholism research was presented. 


Simple Strain Differences 


Despite previous caveats, strain differences continue to be reported and some merit 
brief attention. However, the practice of using laboratory rodents “bought in,” Le. 
reared somewhere other than in the home laboratory, must raise doubts about the 
validity of the findings reported, depending, of course, on what is known about the 
phenotype investigated. 

Interest has arisen in measurements of brain biochemistry. Comparisons among 
rat strains have included telencephalic serotonin metabolism (207), acetylcholineste- 
rase (230), and tyrosinehydroxylase—a rate-limiting enzyme in the biosynthesis of 
brain amines—which was found to relate inversely to activity (221). Rick et al (204) 
found a similar association between activity and GABA. Studies using drugs affect- 
ing the cholinergic system supported the existence of a genetically controlled brain 
cholinergic mechanism which facilitates exploration in mice (3, 4). 

Several studies have been based on rat strains selected for behavioral characteris- 
tics including derivations of the Maudsley and Roman strains (see 161, p. 535, 150, 
p. 55 respectively). The most recent of a series (102) has confirmed strain differences; 
emotionality, as measured by suppression of drinking by unsignaled shock, was 
unrelated to defecation (121), as was spontaneous activity and avoidance condition- 
ing (211, 213). 

In further reports from Schlesinger and co-workers (see 150, p. 41, and 157, p. 
454), a heterogeneous and an inbred mouse strain were both found to be susceptible 
to audiogenic seizure priming, though optimum stimulus exposure varied between 
them (20). Substances known to affect levels of biogenic amines reduced susceptibil- 
ity to seizure after priming without affecting priming itself. 


Simple Genetic Analyses 


By this rubric, we mean analyses uncomplicated by environmental variables, though 
genetic designs may be quite complex. When three mouse strains and their first filials 
were compared on water-escape learning, the filials performed as well as the fastest 
inbred strain (77); in another study two strains and their first filials were compared 
for activity by means of two types of automatic recorders (191). When saccharine 
preference was compared in four inbred mouse strains, broad heritabilities were 
found to be high (192). Comparisons involving filial and backcrosses showed ex- 
treme heterosis for nest building in mice, in a study which also presented a diallel 
analysis (143). The pattern of differences between the broad and narrow heritabili- 
ties supports the adaptive significance of this character, interesting for its a priori 
relationship to biological fitness. 
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Van Oortmerssen (183), from an ethological analysis based on extensive filial 
crosses, has suggested that some laboratory strains of mice arise from surface- 
nesting wild progenitors while others derive from hole-nesters. Developmental 
differences in passive avoidance in two strains of mice and their F,s were shown to 
depend on shock level (233), and Hegmann (110) found strain differences in nerve 
conduction velocities which were faster in reciprocal F,s obtained between two of 
the six strains. Wahlsten (258) “bought in” strains for an interesting attempt to 
study mechanisms underlying response to shock and avoidance learning by equating 
shock level eliciting jumping. Multivariate analyses of behavioral sequences in two 
mouse strains and their filial and backcrosses showed evidence of segregation of 
sequence elements (148). 

Of two mouse studies based on parent-offspring covariation, one (245) demon- 
strated the absence of change m a measure of narrow heritability for runway 
learning, while the other (181) showed high narrow heritability for avoidance learning. 


Studies with an Environmental Dimension 


Of more interest than simple demonstrations of strain differences are recent attempts 
to show the limits of generality by utilizing more than one environment. For 
example, interstrain fighting in mice was shown to be dependent on infantile stimu- 
lation (198), and when male mice from two “bought in” strains were kept at five 
different temperatures and three different population densities to study the effects 
of these factors on aggression (100), the strain differences did not interact with 
environmental variables, An interesting limitation of generality arising from strain- 
measurement interaction was reported by Goodrick (95), who compared automatic 
and direct observations of open-field activity which varied differentially in two 
mouse strains. Differences in the effect of a companion on exploratory behavior were 
found in two strains of mice (227). Another social variable, litter size, was similarly 
studied (62). Factor analysis revealed dimensions interpreted as territorial marking, 
autonomic balance, acrophobia, and motor discharge. Although litter size directly 
affected the first two, the only interaction of litter size with genotype was in respect 
of motor discharge. 

Vale et al (251) “bought in” male mice of five inbred strains that displayed 
differences in population density effects for number of aggressive attacks and for 
adrenal weight. Genotype-density interactions were also found, but the relationship 
between behavioral and physiological measures was unsystematic. Scott et al (219) 
linked the effects of amphetamine on fighting in mice to differences in its effect on 
body temperature in two strains. Their findings relate to explanations of the aggrega- 
tion-toxicity effect of this drug which invoke body temperature. 

Investigation of nest building at two ambient temperatures (152) showed for one 
strain a negative correlation with food consumption at the lower temperature. 
Vocalization in infant mice was shown to depend on strain and ambient temperature 
as well as age (14, 180). 


Complex Processes, Including Genotype-Environment Interaction 


With this section we begin the consideration of what we see as the central problem 
in behavioral genetics. Psychologists quite rightly regard their discipline as having 
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analytical difficulties; geneticists view their area similarly. The intersection of inter- 
ests in behavioral genetics must therefore raise problems of considerable complexity, 
on either the genetic or the environmental side and often on both (25). Erlenmeyer- 
Kimling (72), deploring the lack of effort directed to the solution of these problems, 
blames the daunting nature of the task. We have necessarily concentrated on studies 
that emphasized these complexities, but we do not, of course, undervalue those that 
attempted to provide a framework on which to build further. Some of what has 
preceded may be regarded as falling into this latter category. 

Vale and collaborators, who recognize the importance of such a positive approach 
(see 150, p. 76), have presented a series of intriguing sex X strain X hormone 
interaction studies of aggressive and sexual behavior in mice (248-250). Their results 
have led them to suggest that sexual behavior may in fact be less dimorphic than 
aggressive behavior. 

Fulker (84) reanalyzed three studies—one on rats and two on mice—of prenatal 
and shock stress (244). Patterns of reciprocal F, differences yielded to a model in 
which the additive genetic effects were buffered by contribution from the maternal 
environment. 

Henderson has continued his systematic attack on problems in this area (see 150). 
In a 6 X 6 diallel study of mice, additive genetic variation in a complex food-seeking 
task was released by environmental enrichment, although the underlying genetic 
pattern remained essentially the same under both environmental treatments (113). 
A similar finding emerged for spontaneous alternation (111). Brain weight showed 
greater heterosis in the environmentally enriched group (112). In another diallel 
study on enrichment, Henderson found that it led to only small changes in discrimi- 
nation learning (114). These studies can be contrasted since aversive motivation was 
used in the second study. 

In attempting the analysis of gene-action in relation to environmental determi- 
nants, the various approaches adopted by Broadhurst & Jinks (24) open several 
possibilities. Their implications have been reviewed by Fulker et al (87), who pre- 
sented a preliminary multivariate analysis of an 8 X 8 diallel cross of rat strains, with 
infantile stimulation as an imposed environmental treatment. Genotype-environ- 
ment interaction was considerable in the additive components of variation in open- 
field measures, but minimal in avoidance conditioning. As in Henderson’s work with 
mice, directional dominance not interacting with treatment was found. More de- 
tailed analyses have revealed the subtlety of the interaction within avoidance acqui- 
sition (271) in that a progressive change in genetic contro! has indicated the 
possibility of two distinct behavioral processes underlying learning. Since they 
correspond to the two factors of Mowrer’s theory, it might be argued that behavioral 
genetics is beginning to have something of direct interest to say to one of its parent 
disciplines. 

Less systematic effects suggestive of similar mechanisms in mouse avoidance 
conditioning were found by Royce et al (210) in a 6 X 6 diallel, which also showed 
clear evidence of directional dominance for high avoidance. 

Genetic analysis of factor scores confirmed the different genetic substrates for 
those two frequently combined measures of emotionality in the mouse, urination 
and defecation. The results of two studies (171, 182) using mouse diallels and 


396 BROADHURST, FULKER & WILCOCK 


concerned with conditioning, maze learning, and activity were interpreted in part 
in terms of dominance relationships dependent on the particular crosses involved. 
But this epistasis (nonallelic interaction), evident in extensive biometric analyses, 
may be a function of the choice of extreme parental lines in relatively small diallel 
crosses. 

Several other small diallels in mice have been reported (30, 44, 83, 206, 247). The 
use of the variance-covariance diagram of Jinks for the proper presentation of the 
results is a welcome sign of an increasing awareness of the power of this method. 

As we have seen, Drosophila \end themselves to powerful analyses. The quantita- 
tive approach employs such behaviors as geotaxis (279) and locomotor activity, 
which Hay (107) used in a design employing all possible filial and backcrosses (14 
in all) to establish directional dominance for high activity in D. melanogaster. This 
study also incorporated an environmental variable (culture) and provided the first 
unambiguous demonstration of duplicate gene interaction and dispersed gene distri- 
bution for a behavioral trait. Such genetic architecture is typical of fitness traits 
which determine uniformly high levels in free-mating populations. For a measure 
of preening, additive and dominance maternal effects were found among females, 
whereas among males maternal and progeny genotype interactions occurred. Dial- 
lels gave supporting evidence. In further work (109) genetic factors were implicated 
in social interactions in flies. Manning & Hirsch (153) investigated genetic mecha- 
nisms in a strain of D. stimulans selected for slow mating. 

Population considerations often involve complex behavioral processes (see 150, 
p. 69); Ehrmann has now implicated olfactory cues in density dependent selection 
in D. pseudoobscura (63). In other Drosophila species similar effects are observed 
(64, 108), but frequency-dependent selection was not found in house flies (37). Using 
mice, Yanai & McClearn showed that females determine mating preference (282). 
In some strains they prefer to mate with males of different strains (281), and early 
exposure to adult males may play a part (283). An interstrain comparison of sexual 
performance in pairings forming a “diallel” pattern showed male rats’ success to be 
inversely related to success in the female (164), whereas in a study of mouse 
aggression using filial and backcrosses, restriction to pairing within generations only 
made interpretation difficult (264). In a true diallel cross of mouse sexual behavior, - 
Vale & Ray (246) reduced the problem by pairing uniformly with only one of the 
strains. Models for dyadic social interaction measurment in psychogenetics have 
been discussed (237). 

As we have seen, many of these quantitative studies make inferences from their 
results that bear on natural selection and evolutionary processes. It may be neces- 
sary to use behavior putatively related to fitness to substantiate such inferences (238, 
270). 

Study of the relevance of laboratory populations to wild populations is likely to 
be enhanced by biometric (85) and other (49, 177) techniques for investigating the 
genetic architecture and gene flow of the latter. Meanwhile, comparisons of wild and 
domesticated rodent species have been made (93, 199, 226), with some results 
suggesting that laboratory strains compare more favorably than previous reports 
have indicated (21, 22, 154, 200, 232). 
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HUMAN RESEARCH 


Reviews 


A comprehensive account edited by Kaplan (134), one on twins (172), and others 
on abnormal psychology (136, 224, 228) have appeared. 


Normal Cognitive Abilities 


General intelligence continues to receive most attention. Because early work in this 
area set high standards, many publications attempt reinterpretation and synthesis 
of already existing data. Burt (28), Jensen (126), and Vandenberg (253) have drawn 
extensively on earlier studies to conclude that differences in IQ in the populations 
surveyed are largely under genetic control: broad heritability was found to account 
for 70 to 80% of the variation, family environmental variation for about half that 
remaining, and random environment for the rest. Jinks & Fulker (131) reanalyzed 
data from five major studies using biometric methods (155) to investigate the genetic 
architecture of IQ. Distortions due to sampling and to both gene-environment 
correlations and interactions were judged absent; they therefore concluded that at 
least 22-100 genes controlled IQ, that narrow heritability was 71% (assortative 
mating accounting for 14% of this), and that dominance variation was 15%. The 
average level of dominance was 0.74 or more, dominance being for high IQ, and 
increaser and decreaser genes were found on the average to be equally frequent. A 
simple account of this analysis may be found elsewhere (86). Substantial dominance 
variation detected by three different methods tends to confirm Burt & Howard’s 
earlier finding (see 88, p. 56), while its direction conforms to the expectation that 
components of IQ have been of adaptive advantage during evolution. The influence 
of assortative mating was expected since the IQs of spouses correlate more than any 
other human behavioral trait (254). However, its influence has been most strikingly 
demonstrated by Eaves (56), who used the Reeds’ extensive pedigree data from five 
generations (see 150, p. 51) and showed that only genetic models incorporating 
assortative mating fitted the data successfully. 

Additional evidence (127) supports a fairly simple genetic model: genotype-envi- 
ronment interaction and genetic threshold effects were found to be absent, and all 
three fundamental components—phenotypic, genotypic, and environmental—were 
found to be normally distributed. 

Maternal effects may complicate any simple model: such effects, probably of 
intrauterine origin, are clearly indicated by the association between birth weight and 
IQ, recently confirmed for multiple births (201, 214). Willerman and associates 
(274) suggested postnatal maternal effects on the measured IQ of children of interra- 
cial marriage, but the results rest on weakly supported assumptions. A further study 
employing a similar approach with white subjects presented more convincing evi- 
dence (273). Children resembled their high- and low-scoring mothers more than 
their high- and low-scoring fathers. However, the failure of maternal effects to 
influence parent-offspring correlations differentially indicates their probably minor 
importance as a systematic source of variation. 
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Fascinating evidence is emerging concerning the developmental stability of a 
genetic component in intelligence. Wilson & Harpring (275, 276) report impressive 
results from administering the Bayley scales to 261 pairs of MZ and DZ twins at 
frequent intervals during their first 2 years. The profiles of the developmental lags 
and spurts, responsible for the typically low correlations at different ages, were very 
similar, especially for MZ twins. The figures suggest that heritable factors account 
for about half the variation in profile similarity and demonstrate that even the fine 
detail of some aspects of mental development as measured by the Bayley scale is 
under relatively tight genetic control. In a study of development of IQ from 3 to 
12 years using sibling-sibling and parent-offspring correlations, the average level of 
intelligence also appeared highly heritable, but not the pattern of development (156). 
However, these methods must be less powerful than the use of MZ twins. The design 
has also been criticized on other grounds (262). At the other extreme of age, a 
continuing study of senescent twins suggested that the genetic component in IQ 
remains high well into the eighth decade of life (124). Unfortunately, numbers were 
insufficient to warrant profile analysis. 

A large genetic component in IQ variation extending over a wide age range may 
have important consequences for human population structure. One possible conse- 
quence is that natural selection may be acting on IQ through differential fertility. 
Previous studies (see 157, p. 458) and a later follow-up (261) suggest that selection 
differentials are positive, tending to increase IQ. Selection may not be acting uni- 
formly throughout the full range of IQ (60, 76), however, and the possibility of 
dysgenic trends in some subpopulations should be noted (130). However, since there 
is good reason to believe, on both general evolutionary grounds and recent evidence 
(122), that the narrow heritability of fertility is low in some populations, there may 
be no long-term effects, whatever the size of the heritability of IQ and its correlation 
with family size. 

Another consequence of a large genetic component in IQ is its possible influence 
on class structure and mobility (58, 242). Differences in IQ between father and son 
correlate +.37 with differences in social position (260). Partial and multiple correla- 
tions were used to unravel the interrelated factors of educational level, social class, 
and IQ. Eckland (59) sees these intergenerational consequences of IQ as most 
important. 

Yet a third possible consequence is that genetic endowment might explain part 
of the 15 points average difference in measured IQ between white and black Ameri- 
cans. This claim by Jensen (125) provoked more debate than any other psychogen- 
etic issue of our times; timely press releases by the journal’s editorial staff ensured 
an instant controversy. Jensen’s is a thorough account of the genetics of IQ and 
scholastic achievement which he related to the results of compensatory education. 
These, he claimed, were inconsistent with existing environmental explanations. The 
paper was soon reprinted (126) with invited criticism and a reply (104, see also 105). 
A fascinating blow-by-blow account of the original publication and its early after- 
math can be found in Genetics and Education (129), which again reprints the article 
together with others and bibliographies relevant to the debate. Jensen’s considered 
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reply (130) argues that no plausible combination of environmental factors known 
to influence IQ, and which differentiate blacks from whites, seems capable of pro- 
ducing a mean difference as large as 15 points. He finds most consistent with the 
evidence the hypothesis that from one half to three quarters of the difference is 
attributable to genetic factors. While he convincingly demolishes most current 
environmentalist explanations, we may be simply suffering from a dearth of good 
research into environmental variables. None so far for which there is evidence 
available is deemed adequate, but that is a far cry from proving that none exist. 
Moreover, we rely on a weak form of proof when we rely on the mere consistency 
of a genetic model with the data. Whereas the model for IQ based on white popula- 
tions has been thoroughly tested, such is not the case here. We must avoid what 
Willerman (272) in a book review (73) calls “premature closure” (216). Diverse 
views on race and IQ may be found in (29, 188, 203), but for balanced views in small 
compass, we recommend Thoday & Gibson (243) and Cavalli-Sforza (33). 

The need for more research is seen by many; the question is, of what kind? A 
major difficulty is that inferences about the relative contribution of heredity and 
environment to differences between groups are not possible simply from a knowl- 
edge of their relative contributions within them. DeFries (48) has formulated the 
problem showing the unknown parameters involved. What other (less indirect) 
evidence could be gathered? 

Fostering studies have been suggested (19, 272), Certainly they produce most 
convincing direct evidence for the relative importance of genetic and environmental 
influences within subpopulations. The problem is that fostered children must carry 
their color across populations, unlike the class origins of such children within 
populations. But comparisons involving very young foster children would have 
merit. The study of interracial marriages is a related approach (274). 

A second direct method, which employs blood groups as a covert means of 
identifying the extent of mixtures of genes which typically differentiate blacks and 
whites, and correlates these differences with IQ, has been elaborated by Heston et 
al (see 130, p. 224). Using simpler methods, it was found (184) that such blood 
groups correlate with a measure of mental ability in blacks, though not in whites; 
a Monte Carlo simulation (185) showed that problems of statistical inference in- 
‘volved in stepwise multiple regression were responsible for the high correlation and 
new data failed to show the relationship. Loehlin, Vandenberg & Osborne (151a) 
reanalyzed the original data together with new, calculating separate correlations for 
black and white subjects between composite mental test scores and the presence of 
blood group genes. Rank correlation between these correlations themselves and the 
rank order of decreasing frequency among blacks of the 16 genes studied indicated 
no association between the extent of white ancestry and mental ability. Before 
concluding that there was no evidence for a genetic component in the black-white 
difference, they tested the efficiency of their method and found that genes in one 
blood group system, known to have relatively greater frequency in Europe, failed 
to predict the presence of genes of greater European frequency in other systems. 
They observed that genes that themselves are no longer associated cannot be ex- 
pected to be associated with other genes affecting IQ. Larger samples might help, 
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but the method does not look promising, especially in view of the very large overlap 
in the interracial frequencies. 

A third direct line of research is to seek genotype-environment interactions to 
explain the difference. Scarr-Salapatek (215) carried out a twin study involving black 
and white children using aptitude and achievement tests. She compared heritabilities 
in the different races and social class combinations, and suggested that heritability 
was low in groups disadvantaged by either race or class, thus supporting the idea 
that disadvantaged children do not reach their full potential. While this may be true, 
her data cannot support the conclusion. Eaves & Jinks (57) showed that the study 
suffered from methodological weakness: the conclusions do not even suggest a 
genetic component within races, let alone any interaction with race or social class. 
Barker’s rejoinder (12), insofar as can be judged, suggests that something may yet 
be salvaged! While Vandenberg (252) has reported a small twin study on black 
subjects’ IQ, we clearly do not know enough to suggest genotype-environment 
interaction in black populations. Again the prospect is not promising, for Jensen 
(128) failed to find convincing interaction between race, class, and sex, and evidence 
from white populations suggests no genotype-environment interactions for IQ (127, 
131). 

Turning from general intelligence to other congitive abilities, available evidence 
on school achievement (104, 130) implies that genetic components account for only 
about 40% of variation, half that for IQ. This is partly because common family 
environment accounts for some 50% compared to about 10% for IQ. Such differ- 
ences could explain why compensatory education affects attainment more than IQ. 

Does sex-linkage underlie observed sex differences (99) in many special cognitive 
abilities (144)? Polygenic sex-linkage is indicated by parent-offspring and sibling 
correlations, the pattern depending on sex of parent and of offspring, as originally 
worked out by Mather & Jinks (see 155, p. 291). Stafford, who reported this pattern 
for spatial ability (see 161, p. 526), found a similar pattern for mathematical ability 
and suggested a major recessive gene (236). Further data relating to spatial abilities 
have been reported (103), and Bock & Kolakowski (17) demonstrated a possible 
single recessive sex-linked gene operating against a background of environmental 
and autosomal variation. Combining a variety of spatial test 1tems to score 727 boys 
and girls on a normal ogive model of the test, they decomposed the distributions 
of scores for each sex separately into the two equal-variance normal distributions 
best fitting the data, equal variation being expected once allowance has been made 
for the effect of the major gene. They showed that their model strikingly fitted the 
data and applied equally well to parent-offspring correlations from their own and 
previous studies. Finally they noted that such a recessive gene might be more 
associated with fitness (rather than the more usual expectation of dominance), if 
natural selection acted primarily through the male. Evidence suggesting an autoso- 
mal linkage between a major gene for verbal ability and blood-group genes depends 
on the greater within-pair variance found for DZ twins discordant for blood groups 
(18). 

Other investigated phenotypes of interest include musical ability, which showed 
(235) genotype X instruction interaction; number estimation (101); and laterality 


BEHAVIORAL GENETICS 401 


(146), for which a model postulating one locus for cerebral dominance and one for 
relative hand control was shown to fit a variety of data (178a). Cultural and genetic 
factors could not be distinguished (140) for sensitivity to flickering light in part of 
a study of racial differences in Hawaii (40). 


Mental Retardation 


Given a large genetic component in cognitive abilities, it follows that much retarda- 
tion, other than extreme forms attributable to major genes or traumata, will result 
from normal polygenic segregation. Thus, using a narrow heritability of 70% for 
IQ and allowing for assortative mating, it has been shown (86) that a polygenic 
model predicts within 1% the frequency of subnormality from subnormal matings 
in the Reeds’ data. Combining the two approaches, Berman & Ford (15) predicted 
phenylketonurics’ IQ from those of their parents and siblings. Discrepancies were 
related to phenylalanine levels in the blood. The complexity of genotype-environ- 
ment interaction in this area is illustrated by a study which manipulated these levels 
by dietary regimens to find the optimal effect on IQ amelioration (92). 

Among other factors of possible methodological importance in the genetics of 
subnormality are sex-linkage and inherited maternal effects, already mentioned as 
additional complications of the polygenic model of normal variation. Lehrke (144) 
proposed that sex-linkage for several cognitive abilities might explain the greater 
variability of males, resulting in their greater contribution to the retarded as well 
as to the gifted. His arguments have been criticized mainly on statistical grounds 
(see 145), but the possibility remains. Maternal effects, probably mediated via inher- 
ited intrauterine inadequacy, were detected by Ahern & Johnson (5) in the Reeds’ 
data on IQ. Probably the best evidence for directional dominance for high IQ has 
come from Schull & Neel (see 150, p. 45), who, in their continued study of inbreed- 
ing in Japan (218), have shown effects of parental consanguinity on IQ and achieve- 
ment (179), which, while nonsignificant, reproduced those previously found. 
Retardation should thus result from severe inbreeding. Seemanova (220) writing on 
incest among first-degree relatives, who have a very high inbreeding coefficient (14), 
described an impressive series of 161 such relatives plus 95 half-sibling controls. 
Among the former, 40 showed moderate to severe retardation; among the latter, 
none. 

Kessler & Moos, reviewing chromosomal abnormalities (137), concluded that, 
with the possible exception of space-form deficit in Turner’s (XO) syndrome, such 
abnormalities are only associated with a general retardation, as in Down’s syndrome 
(10). Concerning major-gene effects, for those who curled up with McKusick after 
the last annual review, the third edition is a must (163). 


Normal Personality 


It is well established that most aspects of personality are at least under some genetic 
control. Eysenck (74), Vandenberg (255), and Shields (224) have reviewed the area. 

A methodologically elegant study is that reported by Hill & Hill (115, 116). It 
involved 97 married couples who had been tested with the Minnesota Multiphasic 
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Personality Inventory at school, thus ensuring no postmarriage biases. Twenty-eight 
of their children were then tested at the same age their parents had been, thus 
ensuring no age bias. Despite these small numbers, the results were judged free of 
common artifacts. Positive assortative mating was indicated for three psychoticism 
scales. A cross-correlation matrix yielded intriguing relationships such as the corre- 
lation (+0.3, on the average) of depression in wives with psychoticism in husbands, 
whereas the reciprocal correlations were essentially zero. Parent-offspring correla- 
tions for psychoticism scales, together with those for mania, showed greatest narrow 
heritability when corrected for assortative mating, as this design allows. 

The multivariate methods developed by Vandenberg and collaborators (see 150, 
p. 50) in the cognitive domain have been adapted by Eaves (55) to analyze genetic 
and environmental covariance among MZ twins tested on Eysenck’s PEN question- 
naire. The structure of the within-pair matrix, assessed using principal components, 
indicated environmental covariance structure. Genetic covariance was also investi- 
gated using a kind of canonical analysis. He concluded that environmental influ- 
ences determined variation in neuroticism and introversion-extraversion in a unitary 
manner, but several genetic variables determined psychoticism. A well-defined geno- 
typic factor also emerged for neuroticism, but not for introversion-extraversion. 
Another study employing a dimensional approach has been reported (41). 

The co-twin contro] method allows a control for genotype to permit a more 
accurate assessment of environmental influences. Developmental studies have used 
this method to relate personality differences to possible causal influences (8, 42). 
Babies better endowed became more secure and trusting infants and better coping, 
assertive toddlers. Wilson et al (277) found MZ twins were more alike than DZ twins 
in temper frequency, demands for attention, and crying; this difference between MZ 
and DZ twins was larger among babies than at later ages up to 6 years. Some of 
these findings were related to motor performance (276). Van den Daele (46) reported 
evidence of a genetically controlled regulation of infant responses to rocking and 
soothing sound; hypnotic susceptibility may be controlled similarly (174). In an 
extensive and massive series of twin studies in Sweden by Cederlöf and co-workers 
(see 35, 36), which previously attracted little attention, research is now focused (82) 
on twins discordant for mortality but concordant for smoking. Present evidence 
implicates factors other than smoking, such as accidents and suicide, in the greater 
morbidity found among smokers. Another co-twin control study (149) related dis- 
satisfaction with life to incidence of heart disease. 

In a small twin study of sexual behavior, Chilton (39) produced evidence that the 
age at which masturbation, orgasm, and other sexual activity began is heritable. 
Difficulties inherent in research into sexuality are apparent, but the study points to 
an important, neglected area. 


Abnormal Personality 


Some progress is being made in our understanding of schizophrenia, affective disor- 
ders, alcoholism, and the XYY syndrome; discussion here will be restricted to these 
topics. Two major books have appeared on schizophrenia: Kaplan’s work (133) 
brings together some 30 contributors on a variety of topics related to the genetics 
of schizophrenia, and Gottesman & Shields’ study (96) is based on the Maudsley 
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twin sample. Meehl’s afterword (167) appraises this as a “definitive work, a magnum 
opus, a beautiful book indeed.” It is also a gold mine, since it provides case histroies 
and diagnoses of all 24 MZ and 33 DZ twin pairs originally identified through an 
index case, These diagnoses were investigated in a cross-national study (225). Six 
diagnosticians, differing widely in their training and outlook, not surprisingly pro- 
duced different concordance rates. But consensus diagnosis maximized the heritabil- 
ity, restoring some faith in the diagnosis of schizophrenia. Psychometric 
implications have been discussed (96, 132, 167). While concordance rates for MZ 
and DZ twins have been falling recently, perhaps due to better sampling procedures 
(see 150, p. 64), consensus diagnosis restores concordance values to the region of 
46% for MZ and 14% for DZ (97), much closer to those in the older studies. Allen 
et al (7) also found more searching diagnostic methods increased concordance 
rates. 

New data from Iceland (135), based on public records, gave typically low con- 
cordance rates for many kinds of relatives (but not twins), and preliminary findings 
(13), have suggested that among children of two psychotic parents, the incidence 
of psychosis is about half that previously suggested. Inouye’s data on MZ twins 
separated before the age of 5 years (123) yield the same incidence as that for twins 
reared together, suggesting that between family environmental influences are small 
in determining schizophrenia (see 150, p. 63), but Kety, Rosenthal & Wender and 
co-workers (138, 208) confirmed earlier findings (see 38, p. 386) that incidence of 
broad-spectrum schizophrenia among adopted children of schizophrenic parents 
was as high as that among home-reared children, though the incidence of severe 
cases was much less. Perhaps severity is modified by home environment while 
disposition is not. The same research program produced evidence (263) that it is the 
downward social mobility of schizophrenics rather than prevailing cultural influ- 
ences, that produces the high incidence in low socioeconomic groups. 

Fischer (79) used an ingenious design to investigate the importance of home 
environment to children of parents themselves MZ twins discordant for schizophre- 
nia. The age-corrected risk for children of schizophrenic parents (9.5%) is not 
significantly different from the nonschizophrenic (12.3%), suggesting no obvious 
parental effects. However, the nonschizophrenic parents could hardly be regarded 
as a normal control group. 

Attempts to fit genetic models to data on concordance rates continue (see 150, 
p. 65). But recent attention has been directed to threshold models of liability 
developed by Falconer (75) and others (61, 231) and used by Gottesman & Sheilds 
(see 157, p. 439), who later point out the difficulty of distinguishing a one-gene from 
a polygenic threshold model (97). Kidd & Cavalli-Sforza (139) encountered the 
same difficulty, not surprisingly, since the polygenic model includes the case of a 
single gene as an approximate special case. However, Reich et al’s method (202) of 
multiple thresholds based on different degrees of severity should resolve the issue, 
provided that reliable population base rates for each threshold can be obtained, since 
this method has effectively distinguished between one-gene and polygenic models of 
animal phenotypes. Fulker (86) assessed gene action for liability, using a simple 
biometric extension of the threshold model, and detected additive variation but little 
or no variation due to dominance or between-family environment. 
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As others have pointed out (e.g. 150, p. 64), environmental factors in schizophre- 
nia must be very important, since MZ discordance is about 50%. But the absence 
of a strong between-family environmental effect and the marked discordance point 
to physiological differences, possibly arising early in life. Mednick’s prospective 
study (165, 166) of children at risk is one investigating this possibility. A distinctive 
premorbid condition is found in those adolescents who have suffered breakdowns, 
with birth anoxia implicated. 

In a study of discordant twins (234), biochemical variables have recently received 
most attention. Interpretation was complicated by long-standing drug treatments, 
but the lactate-pyruvic ratio involved in brain oxygen metabolism differed within 
discordant pairs. Pollin (196) discussed evidence bearing on his suggestion that 
catecholamine levels may be important in a physiological predisposition to schizo- 
phrenia. From the same major study, another possible predisposing marker in 
discordant pairs was reported (280) in the form of reduced platelet monoamine 
oxidase activity. Findings from a new, prospective high-risk study by Anthony (9) 
delineate five premorbid syndromes. 

Interest in population aspects of schizophrenia continues. Moran (173) discussed 
the role of heterozygote advantage and mutation in maintaining a high level of 
schizophrenia in the population. Lane (142) discussed the possibility that fertility 
may be even greater than previously supposed, especially among female schizophre- 
nics. 

Research on affective psychoses continues to be dominated by Perris’s (see 38, p. 
388) suggestion that unipolar (depression only ) and bipolar (depression with mania) 
disorders are under two distinct forms of genetic control. Winokur (278) showed 
that when only bipolar cases were considered, evidence of a dominant sex-linked 
gene was striking, a father-son concordance of zero completing the expected familial 
pattern exactly. Mendlewicz et al (168) associated bipolar depression with color 
blindness in the pedigrees of seven families, again suggesting sex-linkage, but unipo- 
lar depression (11) more clearly followed a polygenic pattern of inheritance, since 
the disorder occurred with equal frequency in male and female lines. Perris, who 
originally favored a dominant sex-linked model for the unipolar disorder, now 
favors polygenic inheritance for both (193), as do Slater et al (229); thus Winokur’s 
findings supporting sex-linkage for the bipolar disorder are atypical. Some bipolar 
fathers have bipolar sons which cannot happen with sex-linkage alone. Evidence of 
either heterogeneity of gentic control or a threshold mechanism for both bi- and 
unipolar depression has been presented (169, 197). 

Questionnaire studies have established a genetic component in alcohol use (151, 
190), but Schuckit et al (217) have presented the first clear evidence of a genetic 
factor in its abuse. Half-siblings were used together with objective criteria of alcohol- 
ism and other psychiatric disturbances were avoided. The frequency of alcoholism 
in half-sibs with an alcoholic parent who lived with the child was 46%, compared 
with 50% if they did not. When there was no alcoholic parentage, but the child was 
reared with an alcoholic step-father or mother, the figure was 14% compared with 
8% where the step parent was not alcoholic. The appropriate comparisons clearly 
indicate that genetic factors are important and that the mere presence of an alcoholic 
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in the home unimportant. Good evidence of assortative mating for alcoholism and 
associated psychiatric disorders has been found (205). 

The effects of sex chromosome aberrations on personality continue to be discussed 
(see 38, p. 394; 137), but most attention is focused on the XYY syndrome and its 
possible link with aggressive and criminal behavior. Two excellent reviews (117, 
186) summarize and evaluate available data, the latter review covering a wealth of 
material on physical and developmental abnormalities. The controversy concerning 
the high frequency of XYY individuals in mental and penal institutions is far from 
resolved. Owen concluded that rates in the majority of either mental hospitals or 
prisons are no higher than reliable population rates. But Hook found that for 
institutions of a kind reserved for the criminally insane, rates are from 5 to 20 times 
higher than those in the general population. This discrepancy is partly due to the 
different population rates used and partly to Hook’s rejection of lower rates in 
samples previously screened for abnormal height, but mainly to the different institu- 
tions selected for comparison. Some association between XYY and such special 
institutionalization remains a distinct possibility, although Hook suggests the risk 
is slight. Moreover, both reviews, in line with Kessler & Moos’ earlier review of this 
area (see 150, p. 68), failed to reveal any consistent association with personality and 
behavior. Hook plans a study of 11,000 young males whose sex chromosome compo- 
sitions are to be ascertained at birth. 


Methodology 


This final section will review briefly some methodological developments. Three texts 
on statistical genetics are noteworthy in this connection (34, 45, 155), and some 
work already discussed will be cited again. 

The study of twins will continue to form an important part of most research 
designs. Their uses and limitations have been ably described by Vandenberg (255), 
who also has provided new evidence on intrapair attitudes. Bulmer’s (27) excellent 
book is indispensable to understanding the biology of twinning. 

A general approach to partitioning variation in individual differences in behavior 
has, it is argued (131), certain advantages over others. Assumptions are tested and 
components of the model used are estimated with simple but efficient methods. 
Components can then be validly used to explore genetic control, using conventional 
methods of biometric genetics. The approach is flexible and can be extended to the 
multivariate case.(54). Eaves has also discussed the requirements of genetic design 
in terms of the relationships-twins, siblings, and other family constellations— 
involved (51, 52) and the sample sizes needed (53) if significant results are to be 
expected. Morton (175) described a number of powerful biometric methods applica- 
bie to psychogenetic problems and noted that forms of multiple regression are the 
obvious ways to identify behavioral patterns associated with single genes. With 
many genes and many sets of behavioral phenotypes, the natural extention is gener- 
alized canonical correlation, for which satisfactory methods now exist (31). 

Haseman & Elston (106) have described a powerful method for detecting linkage 
between a major marker gene and either major-gene or polygenic variation by using 
siblings and regressing the square of the sib difference onto the estimated likelihood 
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of carrying the marker. Pakstis et al (189) used this method to detect linkage 
between activity measures in children and blood-group markers. Elston & Stewart 
(65) extended the model to cover more complex genetic situations in which extensive 
pedigree data are available (see also 176). The threshold models (231) which have 
been used in schizophrenia research may be extended to the biometric analysis of 
polygenic inheritance (86) or treated as special cases of complex segregation analysis 
(175). In the latter case the multiple threshold model of Reich et al (202) should 
help distinguish single-gene from polygenic models and thus resolve the protracted 
debate about the genetic determination of schizophrenia. 

Morton (175) is skeptical about genetic analysis where genetic and environmental 
influences are correlated, whereas Cattell developed MAVA, multiple abstract vari- 
ance analysis, for Just such situations (see 88, p. 55). However, it seems likely that 
certain limited models may be possible (131, 170). Cattell’s (32) major recent contri- 
bution has been a heroic extension of MAVA attempting to separate the genetic and 
environmental effects of both maturation and learning. To accomplish this would 
require employing MAVA to partition both independent and dependent variables 
within a number of subpopulations. Correlating appropriate pairs of components 
across such subsets would lead to a regression equation for estimating genetic and 
environmental scores for individual subjects, which could then be examined for the 
effects of maturation and learning. 

The opportunity to test more complex and specific genetic hypotheses is now 
within reach. Methodology is now well worked out and theory is far ahead of 
experimental endeavor. We foresee a period of concentration on gathering empirical 
data which can then be consolidated into a comprehensive structure of knowledge. 

We cannot conclude this review without mentioning Darlington’s book (47), 
which, in the opinion of at least one of us, may prove to have been the most 
important citation of all. 
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Social and cultural influences have been studied in relation to four distinct though 
related aspects of psychopathology: its diagnosis; its occurrence and distribution; 
public attitudes; and treatment. A chapter with the title above, new for the Annual 
Review of Psychology, might well contain sections on all of these topics (cf. 18, 22, 
51, 59, 74, 132, 171). Such inclusiveness, however, would require far more space 
than is available. Our first problem, therefore, is to find a sensible basis on which 
to narrow the focus. 

The key questions about the role of social and cultural factors in the occurrence 
and distribution of psychopathology, and in relation to attitudes and to treatment, 
reflect underlying unsolved problems of definition and etiology. Of these three areas, 
it is the occurrence and distribution of psychopathology that we know most about 
and which holds, in our opinion, the most important clues to understanding the role 
of social and cultural factors in etiology. It is here as well that problems of how to 
define, measure, and diagnose various types of psychopathology present themselves 
in the most comprehensive form. Given the need to be selective, these considerations 
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seem decisive to us. The focus of this review, therefore, will be research on the 
occurrence and distribution of psychopathology in general populations. 

We can further rationalize our choice by noting that interested readers of the 
Annual Review of Psychology are likely to be familiar from previous volumes with 
much of the relevant literature on relations between social and cultural factors and 
treatment. They will have heard, for example, of the important research by Hollings- 
head & Redlich in the 1950s on social class and types of treatment (77) and the 
various replications, extensions, and related studies of other researchers over the 
years since. Moreover, they have available two unusual chapters that are extraordi- 
nary in the ways they contrast with and complement each other: one by Howard 
& Orlinsky in 1972, portraying and analyzing the social and cultural contexts of 
psychotherapy in the United States (78); the other by Cowen in 1973 on Social and 
Community Interventions (25). This material could be usefully supplemented, how- 
ever, by more extended consideration of controversies surrounding what has come 
to be known as the labeling or societal reaction approaches (e.g. 64, 150) and by 
studies of mentai hospitals as social systems (e.g. 19, 58, 141, 160, 173). 

Looking for discussion of research on public attitudes towards psychopathology 
in previous volumes is like searching for a needle in a haystack. Such attitudes, as 
we are now only too aware from what has become known as “the Eagleton affair,” 
can affect the destinies of candidates for high political office as well as the course 
of more usual careers. They are important determinants of who is dealt with by what 
agencies for what problems (cf. 78). It is fortunate for our rationalization, therefore, 
that recent reviews by Rabkin are now available to make up for our omission of 
research on public attitudes from the present chapter (139, 140). 


INTRODUCTION TO RESEARCH ON THE OCCURRENCE AND 
DISTRIBUTION OF PSYCHOPATHOLOGY 


A Brief Review of the Annual Review of Psychology 


Volumes of this Annual Review series contain only brief and scattered references 
to research on the occurrence and distribution of psychopathology in general popu- 
lations. Up to 1958 these consisted for the most part of citations of the early 
publications of Hollingshead & Redlich and their colleagues on rates of treated 
psychiatric disorder in New Haven (77). 

The first and only subsection of a review with the title “epidemiology” was in the 
1958 issue when Garner, after raising some methodological questions, commented 
thus about the Midtown Manhattan study: “. . . the very magnitude of the. . . effort 
ensures its influence as a stimulus to further methodological advance in this area” 
(53, p. 412). 

A year later Blake & Mouton suggested in their chapter on personality that 
research on “the prevalence of mental illness and milder failures of adjustment” 
contributes to an “awareness of widespread tensions, frictions and strains of modern 
living” (13, p. 226). The reason for this suggestion became abundantly clear with 
the publication in 1962 and 1963 of monographs reporting the main findings from 
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the Midtown Manhattan study (159) and from an investigation that is in some ways 
its rural counterpart, the Stirling County study (100). These monographs detailed 
conclusions, anticipated in the more preliminary reports in journals, that rates of 
psychopathology in the communities studied, including untreated as well as treated 
cases, were extraordinarily high: almost a quarter of the adult population in Mid- 
town and over half the adult population in Stirling County. In his 1965 chapter on 
Psychotherapeutic Processes, Matarazzo expressed skepticism about the magnitude 
of these rates (113). Nevertheless, the Midtown study and Stirling County study 
monographs were listed by Zubin in his chapter on Classification of the Behavior 
Disorders as among the “outstanding books” appearing in the 1960-1965 period 
covered by his review (179), and there is recurrent reference to them, usually in 
terms of the great unmet need for psychiatric treatment that they portray, the most 
recent in Cowen’s chapter in Volume 24 (25). 

We found no references to other epidemiological studies of the “true” prevalence 
of psychiatric disorders in our review of the Annual Review of Psychology through 
1973. As the reader will see, however, there are indeed other studies. Moreover, their 
main importance for our problem, like the importance of the Midtown and Stirling 
research, 1s less in any individual study than in the cumulatively consistent relation- 
ships that they report between social and cultural factors and psychopathology. 


The Inadequacy of Studies Based on Treated Rates 


In contrast to the Midtown and Stirling County studies, the majority of investiga- 
tions of relations between social and cultural factors and psychopathology have 
defined psychopathology in terms of admission to psychiatric treatment. Although 
operationally the clearest definition of a “case,” this is also the one with the most 
ambiguous implications. Since treatment rates vary with the availability of facilities 
and with public attitudes toward their use (e.g., 43,48, 80, 84, 125), either of these 
conditions may account for correlations between social and cultural factors and 
rates of psychopathology measured by number of cases in treatment. Thus, when 
the issue is the possible significance of social and cultural factors in the etiology of 
psychopathology, correlations involving only treated cases may be seriously mis- 
leading. Our own analysis of the evidence bearing on this problem has led us to 
dismiss from consideration for the most part studies dealing solely with treated rates 
(33). 


The Studies of “True” Prevalence 


In 1855, Edward Jarvis, a physician in the state of Massachusetts, submitted a report 
on what was probably the most complete and influential attempt to investigate the 
true prevalence of psychopathology conducted in the 19th century (80). This study 
was done almost half a century before the beginning of the “Kraepelinian era” in 
psychiatry (119, pp. 419-89), and Jarvis’ main nosological distinction was between 
“insanity” and “idiocy.” His procedure was to conduct an incredibly thorough 
survey of general practitioners, supplemented by reports of other key informants 
such as clergymen where sufficient information from general practitioners was not 
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available. These informant reports were checked against the records of mental 
hospitals and other official agencies and the resulting data analyzed according to 
such demographic variables as sex, place of birth, and economic status. 

Jarvis believed that only by analyzing statistics resulting from such research “can 
we come to the true causes of insanity, and the real influences that remove it” (80, 
p. 43). Some of the issues he raised on the basis of his findings are vivid forerunners 
of the major issues of explanation that we face today. 

Since the turn of the century at least 58 different investigators or teams of 
investigators in different parts of the world have attempted to count not only the 
treated cases but also the untreated cases of psychopathology in the communities 
that they studied. Excluded from consideration are studies in which the investiga- 
tors have reported scores measuring symptomatology but have not indicated what 
is to be considered a case (e.g. 28, 68, 88, 95, 128, 129, 133). 

Typically, the whole population in a specified geographical area is included in 
each of these investigations. Where studies have relied on sample estimates, the ns 
have usually been large: for example, the Midtown study of a probability sample of 
1660 adults in a section of New York City (159) and the Stirling County Study of 
1010 adults sampled on a probability basis from a rural maritime county in Canada 
(100). 

In most of the studies the rates that we have extracted represent prevalence (the 
number of cases existing regardless of time of onset) during a period of a few months 
to a few years. In four exceptions the investigators presented lifetime prevalence 
rates, and the Stirling County researchers refer to their estimates of “total reportable 
prevalence” as something more than current prevalence and something less than 
lifetime prevalence (100, p. 118). In addition, one study (71) has provided 10-year 
incidence rates (the rates of new cases arising in a 10-year period). By and large, 
these differences in type of rate employed appear to make less difference in the 
results than a number of other factors that will be discussed below. 

In the sections that follow, we will update the coverage of this literature that we 
published in our 1969 monograph (33) by adding to the 44 studies covered there 
the 26 studies that we have discovered since (1, 7--9, 27, 52, 69, 76, 81, 85, 90, 91, 
102, 110-112, 117, 118, 136, 151, 153, 164, 166, 167, 176, 178). We will also extend 
our analyses of what these epidemiological studies and related research have to tell 
us about relationships between social and cultural factors and psychopathology. 

In this review we shall mean by “psychopathology” the types of problematic 
behaviors described in diagnostic manuals such as the DSM-II (4) as operationalized 
by the various epidemiological investigators. Moreover, wherever the data permit, 
we shall restrict the focus to nosological types with no known organic basis—the 
types of behaviors for which the etiological significance of social and cultural factors 
is at issue. This restriction limits us very little, since the etiology of most of the 
psychiatric disorders is still largely unknown. Even for schizophrenia, there is 
controversy about the magnitude of the demonstrated etiological role of genetic 
factors (cf. 103) and no comparably persuasive demonstrations of any role for 
physiological (cf. 162), biochemical (cf. 122), or social and cultural factors (cf. 147, 
p. 426). 
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The legacy that the epidemiological studies of “true” prevalence have left us 
comes in two parts. One is a Pandora’s box of problems with regard to case definition 
and measurement. The second part of the legacy stands out sharply against this 
background of methodological problems. It is a set of relationships, highly consis- 
tent from study to study, between various social and cultural factors and certain 
types of psychopathology. 

Space limitations require that we present summary tables only in the following 
analyses of these two parts of the legacy. The detailed bibliography and tables of 
rates on which these summaries are based can be obtained by interested readers 
according to the procedures described in footnote 1. 


THE METHODOLOGICAL PROBLEMS 


In the large majority of investigations, cases were defined by applying clinical 
judgment to symptom data collected mainly on the basis of personal interviews with 
subjects. Two general approaches to securing these judgmental evaluations have 
been used. First, in most of the European and Asian research, a single psychiatrist 
or a small team headed by a psychiatrist personally interviewed community resi- 
dents and recorded diagnostic judgments on the basis of these interviews. As a rule, 
the interview procedures were not made explicit in this type of approach (e.g. 7, 71, 
85, 101). 

In the second type, by contrast, standard and explicit data collection procedures 
were used. Although the interviews were sometimes done by psychiatrists and 
clinical psychologists and sometimes by lay interviewers, in all instances case identi- 
fication depended on psychiatrists’ evaluations of protocols compiled from the 
interview responses and, sometimes, from ancillary data from key informants, offi- 
cial records, and interviewers’ observations. The Midtown study and the Stirling 
County study referred to above pioneered this approach, and some recent studies 
have adopted their procedures (54, 142, 153). This approach is relatively economical 
when, as in the Midtown and Stirling County studies, lay interviewers rather than 
clinicians are used to collect the data to be evaluated. More important, the written 
protocols that form the basis of the evaluation can be edited to remove direct clues 
to social and cultural background factors such as social class that could bias the 
clinical judgments (cf. 137). 

On the other hand, the procedure has a distinct disadvantage since it makes a 
difference whether the clinicians see and talk to the subject directly; the evidence 
is that clinicians tend to be overimpressed by pathology when working from written 
records alone (16, 36, 63, 152). The question of what information is required to make 
an informed clinical judgment and the question of what information is likely to bias 
such judgment are two horns of a dilemma that has received considerably less 
attention in the epidemiological studies and in the field of clinical assessment in 
general than it deserves. 

Even more economical than having clinicians rate protocols constructed from 
data collected by lay interviewers is dispensing with clinical judgments altogether 
and using objectively scored measures of psychopathology. A growing number of 
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investigators have been doing just that (e.g. 27, 117, 134). Unfortunately, however, 
their decisions to use objective rather than judgmental measures have been based 
on considerations of practicality and cost rather than on tests of the relative ac- 
curacy of these mechanical vs nonmechanical modes of combining data (115, 149). 

The objective measure used most often is a 22-item screening instrument devel- 
oped by the Midtown study researchers on a purely actuarial basis to provide an 
approximation of their Mental Health Rating of psychiatric impairment (94). A 
similar although less widely used measure consisting of 20 Health Opinion Survey 
questions has been developed by the Stirling County study researchers as well (106). 
Both have as their core a portion of the items from the Psychosomatic Scale of the 
Neuropsychiatric Screening Adjunct, developed as an aid to Selective Service 
screening during World War II (161). 

As would be expected from the source of many of them in the NSA, almost half 
of the questions in these instruments are physiological in content while the others 
appear on their face to touch on aspects of anxiety and depression. They are more 
relevant to aspects of neurosis than to psychosis or personality disorder. Such 
structured questions with fixed alternative response categories are easy to administer 
and have been subject, unlike the judgmental measures, to a considerable amount 
of methodological investigation. As a result, they have their proponents (e.g. 152, 
156, 164) and their critics (e.g. 26, 28, 30, 32, 108, 135, 136). 

Their proponents are impressed that these instruments have been shown to be 
reliable, to relate strongly to a host of demographic variables, and to discriminate 
between nonpatients and psychiatric patients, especially when a large portion of the 
latter were diagnosed as neurotic. Their critics worry about their lack of face validity 
as a representative sample of the range and variety of symptoms described in 
diagnostic manuals, their tendency to overrepresent distress in some groups, the fact 
that the more physiological items among them are confounded with physical illness, 
and their susceptibility to response styles—in short, an all too familiar catalog of 
assets and liabilities for personality measures constructed on actuarial bases. 

The application of these various judgmental and objective measures have led to 
dramatically different estimates of the “true” prevalence of psychopathology in 
different communities. In some communities, rates of 1 percent or less have been 
reported and in others rates of 50 percent and more. What could account for 
variability of this magnitude? 

Table 1 shows the medians and ranges in total rates reported for the communities 
studied, grouped according to geopolitical area and according to whether the study 
site was rural or urban. The range in rates within areas is in most cases so much 
greater than the differences in the medians between areas that these contrasts in 
setting do not appear to account for the variation from study to study. 

As we have shown in previous analyses--and there is no reason to change our 
earlier conclusions on the basis of the studies now added—the main sources of the 
variability between studies have to do far less with substantive factors than with 
methodological factors such as thoroughness of data collection procedures and 
contrasting conceptions of what constitutes a case (33). Let us illustrate the manner 
and scope of the impact of these factors on the rates reported. 
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Contrasting Conceptions of What Constitutes a “Case” 


As readers of Annual Review chapters on Abnormal Behavior and Classification 
of the Behavior Disorders over the years must be aware, there has been “a clamor 
against established categories of behavior disorder” (172, p. 290); “The present 
status of the classification of behavior disorders is, to say the least, chaotic” (179, 
pp. 375-76); and the field of behavior disorders is in “a period of ferment, contro- 
versy, and innovation” (137, p. 447). It should come as no surprise, therefore, that 
the single largest source of variability in the rates reported in these epidemiological 
studies is a function of contrasting conceptions of what constitutes a “case.” 

The great expansion of psychiatric nomenclatures following World War II affords 
a vivid opportunity to show the impact of contrasting conceptions of psychopa- 
thology on the results of the epidemiological studies. As Raines noted of the clinical 
experience of psychiatrists in the military in his foreword to the 1952 Diagnostic 
and Statistical Manual of the American Psychiatric Association: 


Only about 10% of the total cases seen fell into any of the categories ordinarily seen in 
public mental hospitals. Military psychiatrists, induction station psychiatrists, and Veter- 
ans Administration psychiatrists found themselves operating within the limits of a nomen- 
clature specifically not designed for 90% of the cases handled (3, p. vi). 


The 1952 DSM was a direct outgrowth of these experiences and its 1968 revision 
(DSM-II) (4) confirms the expansion and includes modifications which bring the 
American Psychiatric Association’s official nomenclature more nearly into line with 
that of a somewhat modified World Health Organization nomenclature (157). 

We would expect on the basis of these changes in nomenclature to find the rates 
reported in epidemiological studies published before 1950 to be much lower than 
the rates reported in studies published in 1950 or later. 

As Table 2 shows, the median rates for the more recent studies are far higher than 
those for the earlier ones. Moreover, the factor of directness of contact 1s dwarfed 
in importance by contrast with the time the study was done. 

It is tempting to see the results in Table 2 as reflecting not so much changes in 
the researchers’ conceptions as representing a true increase in the amount of disor- 
der as a function of the stress of modern life. However, this explanation flies in the 
face of what we know about the direction of expansion of nomenclatures over time. 
Most persuasive of all, it is inconsistent with internal evidence from epidemiological 
studies in which investigators who applied more and less inclusive standards to the 
same population obtained dramatic contrasts in rates. Essen-Möller, for example, 
reported a rate of 13.6 percent for “diagnoses constituting the main subject of most 
psychiatric populations studies” (44, p. 95) but an average of 54.7 percent (calcula- 
tion by B. S. Dohrenwend) for whom pathology was not definitely absent. A similar 
contrast is offered by the figures 23.4 percent in the impaired group and 81.5 percent 
judged less than “well” in the Midtown study (159, p. 138). The Leightons and their 
colleagues reported that over 50 percent of the sample studied would be cases as 
defined by the 1952 Diagnostic Manual (100, p. 121), but for only 3 percent would 
psychiatric attention be judged “ ‘mandatory’ according to current notions of medi- 
cal care” (100, p. 143.) 
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Furthermore, the judgmental measures developed have not always been easy to 
export. For example, the Leightons and their colleagues concluded concerning their 
own studies in Canada and Nigeria that “the differences and similarities between 
the Yoruba and Stirling figures are to an unknown degree under the influence of 
differences in the procedures employed in the two studies” (99, p. 124). And when 
D. C. Leighton rated a subsample of the data collected by another team of research- 
ers who had attempted to use the Stirling County rating procedures, the proportion 
she rated as cases was considerably lower than theirs (153). 

Results with objective measures show similar trends. As Langner demonstrated, 
the proportion among respondents psychiatrically evaluated as “impaired” in the 
Midtown sample who were identified as cases by the 22 item scale varied markedly 
with the cutting point employed on the screening measure (94). Paralleling his 
results are reports of case rates of 27.5 percent in a New Hampshire sample using 
a cutting point of four or more symptoms (134); 10.8 percent in a sample from three 
Plains State communities with a cutting point of 7 or more (117); and 3.4 percent 
in Kalamazoo County, Michigan when a cutting point of 10 or more symptoms was 
used (109). 


Problem of Validity 


Given the great variability in rates resulting from these differences in concepts and 
methods, the question arises as to whether the procedures used in some of the studies 


Table 2 Medians and ranges of percentages of psychopathology reported 
according to directness of contact with subjects and date of publication 


Date of publication 
Directness of contact Before 1950 1950 or after 
with subjects 
Indirect contacts: Median 2.3 3.0 
records and/or Range 1.1-9.0 1.2-14.8 
informants Number of rates 5 6 
Partial direct contact: 
records and/or Median 1.3 9.9 
informants, and Range 0.8-6.4 1.4-28.6 
interviews with Number of rates 5 9 
some subjects 
Direct contact: Median 3.6 18.2 
Interviews with Range 2.7-12.4 0.4-69.0 
all subjects Number of rates 6 40 
Interviews with physical Median = 18.0 
examinations of all Range ~ 10.9-64.0 


or most subjects Number of rates 0 7 
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are better than the procedures used in others. If so, we would be able to concentrate 
on the superior studies in our investigation of relations between social and cultural 
factors and psychopathology. This question is one of evidence—evidence for reliabil- 
ity and validity. 

Clearly the methods are unreliable for comparisons of rates across studies, as the 
results in Tables 1 and 2 show. Within studies, however, the evidence suggests that 
the assessment procedures—objective or judgmental—are of satisfactory reliability 
for research purposes. The Stirling County and Midtown researchers, for example, 
reported quite high agreement among the psychiatrist-raters for the main ratings 
used (100, 159). Moreover, more recent research supports the proposition that when 
there is a common data base, reliability of clinical judgment is not as difficult to 
achieve as had sometimes been supposed from work where nonstandard data bases 
have been employed (cf. 24). A far more serious problem is the validity of the 
assessment procedures. 

In most of the studies based on psychiatric judgments as to whether subjects were 
cases, the validity of the results was assumed to be implicit in the diagnostic process, 
a shaky assumption in light of documented lack of success with psychiatric screen- 
ing in World War II (55). The tacit request by the researchers that the validity of 
their findings be accepted on the basis of faith in the clinical training and experience 
of the clinicians participating in these studies is characteristic, by and large, even 
of the relatively more sophisticated investigations such as the Midtown study and 
the Stirling County study. In general, with the possible exception of an incompletely 
reported Polish investigation (57, 138) and a study conducted by Hogarty, Katz & 
Lowery (76) that was carefully designed on an actuarial basis for a specific practical 
purpose, there is little evidence for content validity, criterion-oriented validity, or 
construct validity in any of these studies. We presented a detailed analysis of the 
problem in 1969 (33), and although considerable methodological research is now 
underway, it has been too preliminary thus far to lead to major innovations in the 
epidemiological studies that we have discovered since that review. For example, a 
computer program now being developed to make the Stirling County study assess- 
ment procedures completely explicit has not been used so far as we know in any of 
the studies that have modeled themselves on the Stirling County approach (124). 

Psychologists familiar with methodological advances in the field of personality 
research in general could well be appalled at the lack of sophistication of these 
epidemiological studies. After reading chapters in past volumes of the Annual 
Review of Psychology on personality, abnormal behavior, and related matters in 
preparation for this review, however, we are inclined to temper our criticism. These 
reveal a “dismal overall picture” in personality research (89, p. 307); that there is 
a proliferation of un-or semi-validated measures (13, p. 207); that “most measures 
used to tap presumably stable personality characteristics have been attacked as not 
measuring what they purport to measure” (21, p. 201); that the “criterion problem” 
with regard to what constitutes improvement in psychotherapy remains unsolved 
(174, p. 367); that “after all these years, and after literally hundreds of studies on 
anxiety, there is still no general agreement as to what the commonly used scales are 
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measuring” (2, p. 233); that personality measurement is in a “‘state of chaos” (49, 
p. 50). Against this background, it is more sensible to regret the lack of sophistica- 
tion about methodological problems in the epidemiological research than to indict 
the researchers for having failed to solve them. 

We are in a situation then where different concepts and methods of assessment 
in the epidemiological studies have led to vastly different assessments of rates of 
psychopathology. Moreover, there is no way to choose, on the basis of the evidence, 
a subset of these studies in which more valid procedures have been employed. 

Note, however, that the diagnostic difficulties are not so different from those faced 
by genetically oriented researchers who have so successfully employed twin and 
adoption strategies in the study of schizophrenia. As Lindzey and co-workers 
pointed out, reported concordance rates for monozygotic twins vary from 6 percent 
to 69 percent as a function of methodological factors (103). These problems have 
not, however, prevented the research from demonstrating impressively consistent 
relationships between degree of genetic similarity and concordance for schizophre- 
nia. What is the reason for this? 

Mech! in 1959 published an intriguing defense of formal psychiatric diagnosis. He 
wrote: 


The fundamental argument for the utility of formal diagnosis can be put either causally 
or Statistically, but it amounts to the same kind of thing one would say in defending formal 
diagnosis in organic medicine. One holds that there is a sufficient amount of etiological 
and prognostic homogeneity among patients belonging to a given diagnostic group, so that 
assignment of a patient to this group has probability implications which it is clinically 
unsound to ignore (116, p. 103). 


Although the epidemiological investigators differ in their concepts [e.g. “‘symp- 
tom patterns” of a particular nosological type in the Stirling County study (100) 
rather than diagnoses as in the majority of studies], methods, and beliefs about 
etiology and prognosis, most present their findings in terms at least of broad nosolog- 
ical distinctions. Moreover, there appears to be considerable agreement among them 
on these vividly contrasting sets of symptom complexes despite differences in where 
they draw the boundaries among the different types and between all of the types of 
psychopathology and “normality.” Our reason for believing that such basic agree- 
ment exists is that we found in previous analyses that there were consistent relation- 
ships from study to study between various types of psychopathology and various 
social and cultural factors (33). Such consistencies we take as evidence of the 
meaningfulness, “the truth,” of these broad nosological distinctions as manifested 
in the epidemiological investigations. 

These grounds seem sufficient to warrant extending our analyses on the basis of 
the expanded number of investigations that we now have available to see what more 
we can learn. In doing so we shall give each investigation equal weight in our search 
for firm relationships, excluding studies only when they do not present data on 
relevant types of psychopathology or on relevant types of social and cultural factors 
for some of the questions that we will ask of them. 
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RELATIONS BETWEEN SOCIAL AND CULTURAL FACTORS 
AND PSYCHOPATHOLOGY 


Most researchers concerned with etiology would agree today with Hamburg and his 
colleagues that “By now, it is clear that serious disorders of behavior may arise from 
many influences—biological, psychological, and social” (72, p. 86). They would 
disagree, however, in their hunches about the relative importance of these influences 
(cf. 73, 147). 

The position that attributes greatest importance to social and cultural factors has 
been summarized by Slotkin as follows: 


Man inherits a limited repertory of responses. Most of his responses are learned, and the 
majority of these are learned from others, t.e., culturally acquired. Consequently, it is to 
be expected that the individual’s responses, abnormal as well as normal, are greatly 
influenced by his cultural milieu. And cultures vary tremendously (155, p. 274). 


Two corollaries of this point of view, as stated by the same author, are that 
“cultural disorganization will lead to personality disorganization” (155, p. 271) and 
that “sheer complexity of civilization contributes to personality disorganization” 
(155, p. 271). At the extreme, as Burgess pointed out, this orientation has led to the 
argument that: 


Among primitive societies . . . psychoneuroses and psychoses as we know them are absent 
or occur very seldom. These societies are static. They are in adjustment to life conditions. 
They provide satisfying roles for their members (17, p. 13). 


This picture contrasts with one drawn of our own society: 


There ts a growing realization among thoughtful persons that our culture is sick, mentally 
disordered, and in need of treatment. ... Our so-called social problems ... are to be 
viewed as arising from the frantic efforts of individuals, lacking any sure direction and 
sanctions or guiding conception of life, to find some way of protecting themselves or of 
merely existing on any terms they can manage in a society being remade by technology. 
Having no strong loyalties and no consistent values or realizable ideals to cherish, the 
individual’s conduct is naturally conflicting, confused, neurotic and antisocial . . . (50, pp. 
335, 339-40). 


This is Lawrence K. Frank writing in 1936. It is a vivid example of similar state- 
ments made before and since—by editorial writers, clergymen, politicians, univer- 
sity students, and the man in the street, as well as by behavioral scientists. The broad 
social causation position that it illustrates provides a rationale for the contemporary 
efforts of community psychologists to secure primary prevention of psychopa- 
thology through strategies of social and community interventions (25). 

It seems to us that these statements of the social causation orientation assume, 
at a minimum, that they are not contradicted by the answers to certain questions 
of fact. They assume that types of psychopathology are different in different cultures 
and that, at the extreme, cultures exist that immunize their citizens against the 
development of psychopathology. Similarly, such statements assume that the 
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stresses and strains of modern mass society produce more psychopathology today 
than in the past and, by extension, that rates of psychopathology are higher in urban 
than in rural settings. They assume that in our “sick society” some parts are 
inflamed and hurting or, in terms of the process involved, that social disorganization 
within society is producing higher rates of psychopathology in some groups than 
in others. 

The “facts” most often cited with reference to such assumptions have come 
mainly from studies of psychiatric patients and anecdotal accounts (cf. 11, 132, 168, 
169, 175). Let us, therefore, examine these assumptions in light of the best evidence 
that we have available—the results of epidemiological studies that have attempted 
to investigate the occurrence and distribution of psychopathology independently of 
treatment status in general populations. 


Are There Cultures That Immunize Their Members Against the 
Development of Psychopathology? 


In 1955 M. J. Field, an English psychiatrist and anthropologist, began a remarkable 
study in rural Ghana (47). It included a subproject on chronic schizophrenia in 12 
Ashanti country towns and villages that is of direct relevance to our question. Using 
key informant techniques and direct observation, she diagnosed 41 chronic schizo- 
phrenics out of a total population of 4283 residents of these villages, a rate of 9.6 
per 1000. We have reviewed elsewhere 31 studies from which we could extract 
“true” prevalence rates for schizophrenia (34). Although these rates do not distin- 
guish between chronic and acute types, and should thus be diagnostically more 
inclusive than Field’s, only one study reported a rate of schizophrenia as high as 
hers (15). Two more reported higher rates, but these were for the occurrence of 
symptom patterns of a schizophrenic type rather than for diagnosed cases of schizo- 
phrenia (99, 159). 

Further evidence is provided by five studies that gave estimates of the overall rates 
of psychopathology in contemporary preindustrial cultures. These studies yielded 
rates ranging from 0.8 percent in two relatively isolated aboriginal groups in Taiwan 
(143) to the highest rate reported in any study to date, 69 percent for an American 
Indian group in the northwestern United States (153). In between are rates of 1.7 
percent reported for the Hutterites, a culturally distinct religious sect which will be 
discussed further (43), and 5.4 percent and 6.8 percent reported by two studies of 
aboriginal groups in Australia (90, 91). Only the American Indian study is based 
on procedures for case identification similar to those used in an industrialized 
culture (153). Since Shore and his colleagues who conducted the study modeled their 
procedures on that of the Leightons, their rate of 69 percent can be considered in 
relation to somewhat over 50 percent reported by the Leightons for a rural maritime 
county in Canada (100). Clearly this comparison does not support the “noble 
savage” hypothesis, nor does the Leightons’ own study of the Yoruba villages in 
Nigeria, where they found rates of about 40 percent (99). 

It can be argued, of course, that we are not hearing about cultures of the kind 
that Burgess discussed as being “static” and “in adjustment to life conditions,” 
because they no longer exist. Rather, the reports are of psychopathology shown by 
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natives of cultures under the disruptive impact of contact with industrialized soci- 
eties (cf. 132). In a sense the argument is academic since there is no way that we 
can construct a time machine to send teams of psychiatrists back into precontact 
periods of history (cf. 11). However, there are ethnic enclaves that have been 
remarkably successful in preserving their traditions in the face of contact with 
modern mass society. Fortunately, a study of one of them provides valuable data 
on our question. 

On the basis of their review of the literature, Eaton & Weil pointed out that 
speculations about the absence of psychopathology in some primitive societies were 
based on limited contact, resulting in observations of uncertain validity (43, p. 22). 
To provide more systematic data, they set out to study the Hutterites, a group with 
a strong reputation for excellent mental health based on just such anecdotal ac- 
counts. , 

The Hutterites are an ethnic enclave of about 8500 persons who had lived for over 
75 years in the United States and Canada at the time of the research. They com- 
prised a religiously oriented and self-sufficient communal society with a secure 
agrarian economy. The authors convincingly estimate that less than 5 percent of the 
males had left their communities never to return, and that the rate among females 
was much lower. There was a record of only one divorce and four separations in 
the history of the group. And no Hutterite had ever been allowed to become a public 
charge as long as he remained a member of his community. This extraordinarily 
stable society had been highly effective in providing cradle to grave support for its 
members. Did it also protect them from developing psychopathology? 

Eaton & Weil’s expectation when the study was initiated was “that few cases of 
mental disorder would be found” (43, p. 229). If they had restricted their study to 
treated rates, they would have confirmed their hypothesis since there were no 
Hutterites in mental hospitals or under treatment by psychiatrists in other settings 
at the time their fieldwork was done. 

On the basis of direct interviews with subjects or key informant reports from 
members of the Hutterite population, however, Eaton & Weil arrived at a rate of 
psychosis that ranked the Hutterites as third highest of ten populations for whom 
they computed age-sex adjusted rates—higher, for example, than the urban and far 
from affluent Eastern Health District of Baltimore. The researchers were aware, 
however, that population comparisons are difficult to make across these studies, and 
they suspected that if the other studies had used comparable procedures, the Hutter- 
ite rate would have ranked relatively much lower. Be this as it may, given our 
question as stated and the initial expectations with which Eaton & Weil began their 
research, it is difficult not to accept their conclusion that the “findings do not 
confirm the hypothesis that a simple and relatively uncomplicated way of life 
provides virtual immunity from mental disorders” (43, p. 209). It would seem that 
the burden of proof is on those who would locate a symptom-free Utopia in the real 
world. 

Nevertheless, it should be noted that persistent and severe antisocial behavior was 
indeed close to being absent among Hutterites. Only four Hutterites were diagnosed 
as having personality disorders and only two of these involved antisocial behavior 


SOCIAL AND CULTURAL INFLUENCES ON PSYCHOPATHOLOGY 431 


that posed persistent problems to the community. Even here the problems were 
relatively mild, taking the form of property thefts in the one case and misuse of 
colony funds in the others. Beyond this there were only minor law violations by 
Hutterites, usually on a one-time-only basis. These results are especially remarkable 
in that the key informant techniques of data collection that Eaton & Weil used 
should be especially effective in securing adequate coverage of this kind of psychopa- 
thology. 

Moreover, although the rate of functional psychosis that Eaton & Weil found was 
appreciable, the subtypes—if not a function of some form of idiosyncratic diagnostic 
procedure—showed an unusual distribution. By contrast with the high ratio of 
schizophrenia to manic-depressive psychosis usually reported, the cases of manic- 
depressive psychosis far outnumbered cases of schizophrenia among the Hutterites. 
The Hutterite rate for the latter, in fact, was the lowest we found in the 31 studies 
we reviewed that reported “true” prevalence rates for schizophrenia (34)—which 
brings us to our next question of fact. 


Are Types of Psychopathology Different in Different Cultures? 


This question has at least four parts: Do the types of psychopathology that we find 
in Western societies exist in non-Western cultures, or are Western classifications 
simply inapplicable? If the Western classifications of types of psychopathology do 
apply, are the distributions of these types the same in all cultures, or are there 
culturally predominant types? Are there new types of psychopathology to be found 
in non-Western cultures? If not new types, are there different symptom manifesta- 
tions of types of psychopathology that are otherwise familiar to us? 


THE APPLICABILITY OF WESTERN NOSOLOGIES In their 1971 Annual Review chap- 
ter on Classification of Behavior Disorders, Phillips & Draguns concluded that 
although these things are matters of controversy, “The majority opinion of the field 
holds that diagnostic categories are universal in representation, though different in 
distribution” (137, p. 469). We would agree, though the evidence supporting the first 
part is considerably firmer than the evidence supporting the second part of the 
opinion. 

Wherever psychiatric epidemiologists have attempted to conduct studies in non- 
Western societies marked by their own distinctively different traditions and a rela- 
tive absence of modern technology, one of their main goals has been to see whether 
Western psychiatric classifications are applicable. We know of no instance in the 
“true” prevalence studies where investigators have attempted to use Western classi- 
fications in non-Western societies and felt them to be inapplicable (e.g. 10, 47, 98, 
178). 

If you doubt the applicability of Western classifications of psychopathology to 
these vastly different cultures and have had clinical training, we suggest that you 
look at Field’s descriptions of a sample of the more than 2500 cases that she observed 
at religious shrines in rural Ghana (47). She presents descriptions of the cases in 
considerable detail and also her own diagnoses, based on Western nosology. See if 
you agree with her clinical judgments. 
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ARE THERE DIFFERENCES IN DISTRIBUTION AMONG THE CATEGORIES OF PSYCHOPA- 
THOLOGY? The question of whether the relatively familiar types of psychopa- 
thology are differently distributed in non-Western societies is more difficult to 
answer. This is an area rich in anecdotal evidence that tends to evaporate when more 
extensive studies are conducted. A good example was the belief that depressive 
symptomatology is extremely rare in rural African communities (11). In the Yoruba 
study, depression was found to be the most frequent symptom among African 
women (99), as it was among the women who brought their problems to rural 
shrines in Ghana (47). The earlier reported absence of depression may stem from 
the focus on persons in treatment or persons defined by local norms as deviant. Such 
definitions of disorder can be very misleading about the types of psychopathology 
found in general populations, as Katz’ comparisons of hospital patients and commu- 
nity samples from contrasting ethnic groups in Hawaii suggested (86). 

Although there is far less cultural contrast between the United States and England 
than between, say, the residents of Stirling County and the residents of Yoruba 
villages, a much higher proportion of schizophrenia compared to manic-depressive 
psychosis was found among hospitalized patients in the United States than in the 
United Kingdom. When diagnostic criteria between the two countries were stan- 
dardized, however, this difference largely disappeared (24). It is possible that the 
unusually high ratio of manic-depressive psychosis to schizophrenia found by Eaton 
& Weil among the Hutterites is also similarly a diagnostic artifact. The fact is that 
comparisons of rates for types of psychopathology across different investigators 
using different methodologies is hazardous. The boundaries of the different types of 
psychopathology are simply not drawn with sufficient consistency from study to 
study. 

Nevertheless, we believe it is likely that types of psychopathology are indeed 
differently distributed in contrasting cultural settings. The main reason is a set of 
highly consistent findings of differences in types of psychopathology between groups 
governed by different social norms within communities. These findings will be 
discussed in detail in a later section. 


ARE THERE DIFFERENT SYMPTOM MANIFESTATIONS OF TYPES OF PSYCHOPA- 
THOLOGY THAT ARE OTHERWISE FAMILIAR TO us? It is by now so well recognized 
that symptoms can take on coloration from the culture that we shall simply join the 
chorus by agreeing with this conclusion (cf. 10, 20, 47, 87, 98, 137, 178). Cows and 
witches, for example, occupy a more important place in the content of symp- 
tomatology in rural Ghana (47) than in the content of symptomatology shown by 
residents of Midtown Manhattan (159). Research subjects from lower class back- 
grounds are more likely to express psychological distress in somatic terms than 
research subjects from higher class backgrounds (e.g. 26, 77). So are Puerto Ricans 
by contrast with Blacks, Jews, and Irish (33). And screening instruments composed 
of symptoms such as those in the 22-item measure from the Midtown study are more 
representative of female than of male modes of expressing distress (14, 45, 136). 

Such cultural differences in modes of expressing distress are interesting in their 
own right. They also pose serious measurement difficulties when the research prob- 
lem is to compare groups that differ in social and cultural background—a method- 
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ological problem that has received very little systematic attention. Our own efforts 
to deal with this problem of valid cross-group comparison yielded a strategy so 
complex that it is reminiscent of a Rube Goldberg (39). Perhaps as more attention 
is focused on this issue a simpler solution will develop. 


DOES SOME OF THE EXOTIC SYMPTOMATOLOGY OBSERVED IN NON-WESTERN CUL- 
TURES INDICATE THE PRESENCE OF TYPES OF PSYCHOPATHOLOGY THAT ARE NOT 
INCLUDED IN WESTERN NOMENCLATURES? With this question, we are back again 
in the realm of controversy (e.g. 137). Yap, for example, argued that such exotic 
syndromes as amok and Jatah can be placed in the context of modern clinical 
psychiatry (177). Murphy, by contrast, treated them as distinct disorders (127). 
Which is the more useful view, we think, depends in part on the particular research 
purpose, for there is no way to resolve the issue without increased knowledge of 
etiology. 

Amok and Jatah are vivid enough to have left traces that can be analyzed in their 
own right—as Murphy’s historical detective work shows. In such analyses of trends 
over time, he has every reason to consider them separate types of psychopathology. 
By contrast, as Yap emphasized, common classification systems are needed for 
communication and comparison. It is thus the bias of the epidemiologist under the 
usual circumstances of his trade to want to employ standard categories for compara- 
tive purposes. The broader issue, as Yap well states it, is “how to integrate new 
knowledge into old, or, conversely, how to allow the application of accumulated 
knowledge to fresh clinical problems” (177, p. 34). 


Is There More Psychopathology Today Than in the Past? 


Some types of symptomatology leave traces over time in that they are damaging 
enough or exotic enough so that they will be recorded or recalled, as in the case of 
amok and Jatah (127). Or they are associated with a physical condition that will 
invariably bring afflicted persons into treatment or will show up on autopsy, as in 
the case with some types of ulcer (165). There is considerable evidence that rates 
of such phenomena vary considerably over time (127, 165). What of change over 
time in rates of the broad types of psychopathology described in Western classifica- 
tions? 

In what must surely be a classic study, Goldhamer & Marshall capitalized on an 
unusually fine set of historical records in Massachusetts to investigate hospital first 
admissions between 1840 and 1940 (61). From statistical evidence it is clear that 
hospital admissions for psychiatric disorders up until recent innovations in services 
have indeed tended to increase markedly over time (e.g. 107), undoubtedly in- 
fluenced by increases in facilities, improved diagnoses, changes in public attitudes, 
and changes in the age composition of the population. If, therefore, Goldhamer & 
Marshall had reported just one more study finding an increase in treated rates, we 
would be entirely justified in dismissing their findings as being irrelevant to our 
question. What makes their actual results important, however, is that in circum- 
stances biased against such an outcome, they found zo increase in rates of the 
functional psychoses of the early and middle years. 
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Goldhamer & Marshall speculated that perhaps by 1840 the processes of industri- 
alization were far enough along in Massachusetts so that the 100-year period did 
not extend back far enough in time to permit changes to be detected. As Grob has 
pointed out, psychiatrists in the middle of the 19th century tended at the time to 
see insanity as part of the price being paid for increasing urbanization (66). More- 
over, the Goldhamer & Marshall findings tell us nothing about whether rates for 
the neuroses and for the personality disorders have increased over time. 

Once again we lack the time machine that would send psychiatric epidemiologists 
back into history to conduct the required research. At the very least, however, 
Goldhamer & Marshall’s findings should give pause to facile speculation about 
increases in rates of the functional psychoses with the stresses and strains of modern 
mass society as we know it today—as opposed to what appears to be the all too 
recent yesterday of our great grandfathers! 


Are Rates of Psychopathology Higher in Urban Than in Rural Areas? 


As we implied earlier, there is a widely held belief that the stresses and strains of 
modern technological societies are concentrated in their urban centers. A corollary 
of this belief is that high rates of psychopathology—along with slums, crime, and 
riots—are similarly concentrated in such centers. The empirical basis for such 
opinion, however, has been slim. While it is true that investigators have typically 
found much higher rates of first admissions to mental institutions from urban than 
from rural areas (cf. 146), this information tells us little about the true rural-urban 
distribution even of hospitalizable psychopathology. On the contrary, it has long 
been known that rural and urban rates of treated cases differ in part as a function 
of the lesser availability of treatment facilities and the greater resistance to their use 
in rural areas (80, 125). 

Given the contrasts in their concepts and methods, comparisons of the rates 
obtained by different investigators is frustrating and uninformative so far as substan- 
tive questions are concerned. However, eight investigators have reported data from 
both rural and urban segments of the populations they studied, with two reporting 
data for two segments, each divided into rural and urban portions. Thus we can look 
at 10 pairs of rural and urban rates from studies within which uniform procedures 
were used. 

As Table 3 shows, in one comparison the total rate for all psychopathology 
combined is higher in the rural setting; there is one tie; and in the remaining eight 
comparisons, the urban rate is higher than the rural rate. We must point out, 
however, that although quite consistent, the differences are not large; the median 
difference for total rates is only 1.1 percent and the range is from an excess of 0.9 
percent in the rural area to an excess of 13.9 percent in the urban area. 

The remaining results in Table 3 provide some indication as to why most of the 
differences in total rates are small. While some types of psychopathology were found 
to be more prevalent in the urban setting, others were found more frequently in the 
rural setting. That is, total rates for all psychoses combined tended to be higher in 
the rural than in the urban area in these comparisons, and this appears to be so as 
well for the manic-depressive subtype though not for schizophrenia which may be 
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Table 3 Comparison of rural and urban rates of psychopathology reported 
for studies that included both rural and urban areas 


Type of psychopathology Number of studies reporting higher 
rate in: 

Rural area Urban area 
AU? 1 8 
Psychosis 5 2 
Schizophrenia 2 3 
Manic-depressive psychosis 3 1 
Neurosis 1 5 
Personality disorder 1 5 


din addition, one study reported the same rate for rural and urban areas. 


equally frequent in rural and urban settings. By contrast, the rates of neurosis and 
personality disorder are higher in the urban than in the rural setting in all but one 
of the six pairs of comparisons for which we could extract data on each of these types 
of psychopathology. 

These results are obtained, of course, on the basis of only ten pairs of rural versus 
urban comparisons and can hardly be considered representative in any systematic 
sense of the term. Moreover, the absolute rural-urban difference in rates—especially 
the overall rate-—tend to be small (35). We are inclined, nevertheless, to assign more 
importance to them than others might (e.g. 158). Our reason is that the studies have 
publication dates from 1942 to 1972; take place in the United States, Europe, Asia, 
and Africa; involve cities as different as Tokyo, Reykjavik, and Abeokuta (in Ni- 
geria); and report overall rates of psychopathology ranging from 0.8 percent to 45.0 
percent. The consistency in direction of many of the urban-rural differences reported 
in these studies—despite the diversity of time, place, and method of assessing 
psychopathology—suggests to us that the results be taken seriously. On the basis 
of this evidence—and it is the best that we have available—the most reasonable 
hypothesis appears to be that total rates of psychopathology are somewhat higher 
in urban than in rural areas, due at least in part to an excess of neurosis and 
personality disorder in the urban areas. Whether these differences are a function of 
harsher stress on residents of urban settings, however, is quite another matter. 

However harsh and threatening cities may sometimes be, they also provide con- 
centrations of industry and commerce, wealth and power, art and entertainment 
that make them magnets for rural people. Migrants seeking greater opportunity, 
challenge, or perhaps anonymity than rural environments provide are drawn to 
cities in large numbers. The possibility cannot be overlooked, therefore, that they 
bring with them the types of psychopathology that appear to be more concentrated 
in urban areas rather than developing them in reaction to the stressful circumstances 
of the urban environment. 

Straightforward evidence to resolve this issue is not available. That is, no one has 
yet surveyed members of a rural population for evidence of psychopathology, or 
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perhaps a proneness to develop such disorder, at one point in time and then followed 
this population to determine which members migrated to urban areas and what the 
migrants’ impact was on the rates of psychopathology in the areas to which they 
migrated. Our analysis of the more indirect evidence bearing on this problem (5, 75, 
126, 130) has led us to conclude that the issue is unresolved (35). 


Are Rates of Psychopathology Within Societies Higher in Some Groups 
Than in Others? 


Not all of the “true” prevalence studies provide data broken down by the major 
types of psychopathology in which we are interested; still more present no data on 
either total rates or on subtypes for all of the social categories that we would like 
to examine; and for some factors such as age, the subclassifications across studies 
are far from standard, greatly complicating the analysis (33). Thus the analyses that 
we shall summarize below are restricted to two sets of factors for which consistent 
trends could be established on the basis of an appreciable number of studies. The 
two sets of factors are important: they are sex and social class. 


SEX DIFFERENCES In 1850, Jarvis published an analysis of reports from about 250 
American and European hospitals that suggested to him that men were more prone 
to insanity than women (79). Grob, summarizing Jarvis’ long-standing interest in 
sex differences in this investigation and other writings, noted that Jarvis believed 
that the explanation lay in the dissimilar roles of men and women in society with 
men “more exposed to sensual temptation and the drive for materialistic rewards; 
women, having a more sympathetic and sensible personality, were more exposed to 
emotional factors” (66, p. 49). 

In a recent review, Gove & Tudor found much in the writing of contemporary 
social scientists to suggest quite a different picture for our time from Jarvis’ picture 
of his. Gove & Tudor argued that now “... women find their position in society to 
be more frustrating and less rewarding than do men and that this may be a relatively 
recent development” (65, p. 816). Accordingly, they “postulate that, because of the 
difficulties associated with the feminine role in modern Western societies, more 
women than men become mentally ill” (65, p. 816). 

To investigate this question, Gove & Tudor focused on research conducted since 
World War II, in the belief that this time was a high point of change in women’s 
role with significant portions of married women entering the work force for the first 
time. They also restricted their coverage to research conducted in industrialized 
countries of North America and Europe. The most important data that they brought 
to bear on the issue came from some of the same “true” prevalence studies with 
which we have been concerned, and from community studies using various symp- 
tom measures, such as the 22-item Midtown scale, but not indicating what is to be 
considered a “case.” The results are remarkably consistent with their hypothesis. 
However, they made a decision about the definition of psychopathology that badly 
compromised their treatment of the question. 

They defined “mental illness” as consisting of neurotic disorders and functional 
psychoses if the studies provided data in the form of diagnoses, or as “personal 
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discomfort” as indicated by measures of subjective distress and anxiety if the studies 
provided data in the form of symptom scores only (65, p. 812). In doing so, they 
combined disparate types of psychopathology and they excluded a major diagnostic 
category, personality disorder. The unfortunate consequences of these decisions will 
soon become apparent. 

In our previous reviews of the “true” prevalence studies, we found that overall 
rates of psychopathology did not show consistent differences according to sex, nor 
did rates of psychosis in general (33) or schizophrenia in particular (34). In sharp 
contrast, we found in our 1969 review that consistently higher rates of neurosis were 
reported for women while consistently higher rates of personality disorder were 
reported for men (33). The studies that we have added since the earlier review 
change this broad picture very little with respect to rates of psychosis, rates of 
neurosis, and rates of personality disorder according to sex. However, whereas in 
our 1969 review we found 12 studies showing higher overall rates for males and 18 
for females, the tally now is 16 in which men and 27 in which women show the 
higher total rates of psychopathology. 

Gove & Tudor reported that when they divided the North American and Eu- 
ropean studies that we included in our 1969 review into pre-World War II and 
post-World War II groups, overall rates were consistently higher for men in the 
earlier studies and for women in the later studies. Given the biased definition of 
“mental illness” that they used in their own analysis of the literature, it would seem 
that this is the strongest support that they have for their hypothesis, other things 
equal. Other things, however, are not equal. 

Women’s roles in North America and Europe should be less traditional and 
hence, by extension of Gove & Tudor’s argument, more stressful in urban than in 
rural areas, especially in urban areas following World War II. Thus, neurosis and 
functional psychosis, the two broad types of psychopathology that Gove & Tudor 
combined with measures of anxiety and distress to indicate “mental illness,” should 
each be more likely to show higher rates for females in the later studies than in the 
earlier and in the urban than in the rural settings—assuming as Gove & Tudor do 
that environmentally induced stress is the major factor responsible for these types 
of psychopathology. 

Unfortunately, few of the studies published prior to and early in World War II 
(there is a gap in publications between 1943 and 1950) provide data on psychosis, 
neurosis, and personality disorder according to sex. As Table 4 shows, no clear 
trends emerge for these types of psychopathology in these earlier studies. The later 
studies, however, show higher rates of neurosis for women than for men in both 
rural and urban settings, while for psychosis, the results are just the opposite of what 
Gove & Tudor’s position implies, as women tend to show higher rates in rural 
settings while men tend to show higher rates in urban settings. The latter finding 
is more congruent with Jarvis’ beliefs about sex roles in the “modern industrial 
society” of 19th century Massachusetts than Gove & Tudor’s ideas about sex roles 
in the “modern industrial society” of today. 

The question remains as to why in Table 4 overall rates of psychopathology 
should be higher for males in 7 out of the 9 earlier studies while overall rates for 
females are higher in 18 out of the 19 later studies. This is further documentation 
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of the reversal over time so perceptively noted and incorrectly interpreted by Gove 
& Tudor on the basis of our 1969 review of a smaller number of studies. 

The key to the answer lies, we believe, in the fact that the overall rates of 
psychopathology in the later studies show the same relation to sex in Table 4 as does 
neurosis—-consistently higher rates for females in rural as well as in urban settings. 
As in the case of neurosis, this is hardly what we would expect on the basis of Gove 
& Tudor’s role change argument and the reasonable assumption that women’s roles 
are more traditional in rural settings. However, it is what we would expect in terms 
of another set of considerations that have to do with methodological differences 
between the earlier studies and the later studies: first, the use of expanded nomencla- 
tures following World War II; second, the use of symptom items such as the 22-item 
scale from the Midtown study and a battery of similar items developed by the 
Stirling County researchers in many of the later studies. As we showed earlier, the 
expanded nomenclature is largely responsible for the increase in overall rates shown 
by the studies published in 1950 or later. Moreover, as was also noted, the 22-item 
scale and similar measures tend to sample neurotic types of symptomatology rather 
than symptoms of psychosis or personality disorder. These methodological consid- 
erations, rather than the role hypotheses of either Jarvis or of Gove & Tudor, seem 
to provide the most parsimonious explanation of the reversal over time in overall 
rates according to sex shown in Table 4. 

Nevertheless, there is a major substantive issue posed by some of the results in 
Table 4 and it lies in the contrasting excess of women among neurotics and of men 
among those diagnosed as having a personality disorder. The contrasts are genera! 
for rural and urban areas and tend to hold as well in the studies, excluded from Table 
4 above, done outside of North America and Europe. Either social factors that define 
sex differences are more constant or biological bases of such differences are more 
potent than social causation theorists have supposed. 

Annual Review authors have been pointing out scattered but interesting reports 
of sex differences and calling for more research for years—and at an accelerating 
pace in more recent volumes (82, 104, 148, 154). As Bardwick’s recent book (6) and 
Maccoby’s review of it (105) suggest, the issue of the relative contribution of biologi- 
cal factors and social factors to these differences is very much alive. We would 
suggest that the issue comes to an interesting focus in the sex differences in neurosis 
and personality disorder summarized above. 


SOCIAL CLASS Qn the basis of previous analyses of the “true” prevalence studies, 
we found that the most consistent result reported was an inverse relationship be- 
tween overall rates of psychopathology and social class (33). As Table 5 shows, this 
relationship holds when we add additional findings from among the studies that we 
have located since our 1969 review. Of the 33 communities for which the relevant 
data were reported, 28 yielded the highest rate in the lowest class. Moreover, 
although an earlier, less complete compilation of findings led us to speculate that 
this inverse relationship between social class and overall rates of psychopathology 
did not hold in rural areas (35), the expanded coverage of studies, reported in Table 
5, no longer suggests as clear a difference in this respect between rural and urban 
areas. 
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Table 5 Comparison of rates of different types of psychopathology in lowest and other 
than lowest social classes within rural and urban study sites 


Study site 
Rural Urban Mixed? 
Number of studies in which rate is highest for: 

Type of Lowest Other than Lowest Other than Lowest Other than 

psychopathology class lowest class class lowest class class lowest class 
All 9 4 17 1 2 0 
Psychosis 1 5 3 3 2 1 
Neurosis 3 3 4 3 0 2 
Personality disorder 4 2 5 1 2 0 


“Data reported for sites that are not predominantly rural or urban with no separation of 
rural and urban rates. 


We have been able to add few data on the major subtypes of psychopathology and 
what we have added produced little change in the trends that we reported in our 
earlier review. Personality disorder shows a consistent inverse relationship with 
class; neurosis and psychosis do not. 

It should be noted, however, that the broad category of psychosis masks some 
strong differences between two subtypes, manic-depressive psychosis and schizo- 
phrenia. Only seven studies reported data on schizophrenia and manic-depressive 
psychosis according to class (33). Of these seven none reported the highest rate of 
manic-depressive psychosis in the lowest class while, by contrast, five of the seven 
reported the highest rate of schizophrenia in the lowest class. In addition, the 
Midtown study researchers, who did not report data in the form of specific diag- 
noses, found an inverse relationship between class and symptom patterns judged to 
be of a schizophrenic type (97). The only exceptions to this inverse relationship are 
studies done in two Japanese communities, and they may be exceptions in Japan 
also. That is, Kato, a Japanese epidemiologist, has informed us that they are at 
variance with unpublished findings from his nationwide study of untreated as well 
as treated disorder (85). 

Furthermore, studies of schizophrenia based only on treated rates have usually 
shown the same inverse relationship with class (41, 46, 70, 77, 123, 170). Among 
these studies of treated cases—and there are about a dozen reported or reviewed in 
the references cited above—only one did not show the highest treated rates of 
schizophrenia in the lowest social stratum (23). 

Altogether then there is considerable evidence of a high concentration of psy- 
chopathology in the lower class compared to all other classes. Why should this be 
so? 

In 1939, Faris & Dunham published their classic ecological study of psychiatric 
disorder in Chicago (46). Their well-known finding was that hospital first admission 
rates for most psychoses, including schizophrenia, were highest in the central slum 
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sections of the city and decreased regularly as one moved out toward the well-to-do 
suburban areas. Consider the explanation that Faris & Dunham favored for their 
results: 


In these most disorganized sections of the city and, for that matter, of our whole civiliza- 
tion, many persons are unable to achieve a satisfactorily conventional organization of their 
world. The result may be a lack of any organization at all, resulting in a confused, 
frustrated, and chaotic personality, or it may be a complex but unconventional original 
organization. In either case there is a serious divergence from the conventional organiza- 
tion which makes communication and understanding impossible and, for that matter, 
makes any form of cooperation difficult. It is just this type of unintelligible behavior which 
becomes recognized as mental disorder (46, p. 159). 


This view that social location is a cause of psychopathology is reflected in the 
theoretical explanations supplied in most of the best known epidemiological studies 
conducted by U.S. investigators. The study, for example, by Hollingshead & Redlich 
of treated rates of psychopathology according to social class in New Haven was 
motivated by, and the findings explained in terms of, a view that “psychiatrists work 
with phenomena that are essentially social in origin” (77, p. 11). The central theoret- 
ical hypothesis of the Stirling County study (100) and the Yoruba study (99) was 
that “social disintegration” is causally related to psychopathology, and the results 
of the studies were interpreted as being consistent with this hypothesis. Similarly, 
the Midtown study investigators took as their "most fundamental postulate” the 
proposition that “sociocultural conditions ... have measurable consequences re- 
flected in the mental health differences to be observed within a population” (159, 
pp. 12-13), and they interpreted their main findings in terms of the pathogenic 
consequences of a “poverty complex” and of processes of “stigmatize-rejection” and 
“role discontinuity” (159, pp. 366-71), further elaborated in terms of a theory of 
environmentally induced “stress” and “strain” (97). 

There have also been at least two ambitious extensions of these approaches along 
more social psychological lines, each strongly influenced by the work of Merton 
(120). One of the approaches was developed in an investigation of drinking and 
related problem behavior in a town and its surrounding rural area in southwestern 
Colorado (83); the other focused on explaining symptoms and diagnoses in, respec- 
tively, a community sample and a psychiatric patient sample of Blacks in Philadel- 
phia (133). Both theoretical formulations postulate that a major source of stress is 
created by environmentally induced dysjunctions between goals and the means to 
achieve them and in related discrepancies between aspirations and achievements. 

All of these theoretical positions are examples of social causation orientations. In 
each, psychopathology is viewed as in large part a consequence of environmentally 
induced stress on the individual. To varying degrees these approaches are quite 
compatible with Frank’s “sick society” analogy, with the addition that they locate 
the sources of the social organism’s greatest inflammation and pain in its slums and 
low status groups. 

Foreshadowing the findings that we have been discussing, Jarvis reported in 1855 
that “the pauper class furnishes, in -atio of its numbers, sixty-four times as many 
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cases of insanity as the independent class” (80, pp. 52-53). His explanation for this 
finding differed sharply from the environmentalist views described above, however. 
He wrote: 


Men of unbalanced mind and uncertain judgment do not see the true nature and relation 
of things, and they manifest this in mismanagement of their common affairs. They do not 
adapt the means which they possess or use to the ends which they desire to produce. 
Hence they are unsuccessful in life; their plans of obtaining subsistence for themselves or 
their families, or of accumulating property, often fail; and they are consequently poor, 
and often paupers. ... the cause of ... their mental derangement lies behind, and is 
anterior to, their outward poverty (80, pp. 55-56). 


In this interpretation Jarvis provides an early example of what has come to be known 
as the social selection explanation of relationships between social position and 
psychopathology (40). This type of explanation was encountered previously with 
regard to the rural-urban differences, in the suggestion that rural migrants to cities 
may bring psychopathology with them rather than developing it in response to the 
stress of urban living. It stems from a theoretical orientation more characteristic of 
epidemiological researchers in Europe today than in the United States (131, 163). 

By contrast with the social stress theorists, social selection theorists see social 
position not as a cause but as a consequence of psychopathology. And just as a social 
stress explanation is consistent with an environmentalist orientation to etiology, so 
is a social selection explanation consistent with a genetic view. 

In his essay introducing present-day readers to Jarvis’ 1855 report, Grob makes 
an interesting comment on Jarvis’ interpretation of the relationship he found be- 
tween poverty and insanity that helps to put this issue in prespective: 


Like others coming from the same background, Jarvis saw the genteel New England 
society that he loved and valued being eroded. ... In a very real sense, therefore, some 
of the points that he attempted to make... were a reflection of his own social ideology. 
... What Jarvis did was to take a series of statistics and read them in the light of his own 
moral assumptions ... (66, p. 56). 


Would Grob be less critical, however, of the social stress interpretations of the 
results of the more recent epidemiological investigations that have reported findings 
so similar to Jarvis’? We doubt it. 

As early critics of Faris & Dunham have argued, downward social drift of 
previously ill persons could explain plausibly the concentration of cases in the slums 
of Chicago (e.g. 121). Nor does a finding in the New Haven study that most lower 
class cases of psychopathology had originated in that class rather than having 
drifted down to it resolve the problem. As Gruenberg pointed out, in a society such 
as our own where upward social mobility is the norm, it is quite possible that the 
healthy members of the lower class have been selected upward, leaving behind a 
“residue” of il] (67). Both downward “drift” and this notion of “residues” are major 
varieties of what was designated above in broader terms as the social selection 
hypothesis. E 
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Some findings from the Midtown Study illustrate the difficulties involved in 
choosing between social stress and social selection explanations of the relation 
between rate of psychopathology and social position. In an advance over previous 
epidemiological studies, the Midtown researchers investigated parental class posi- 
tion, a factor clearly antecedent to the current psychiatric condition of their adult 
respondents. Finding a significant inverse relation between their respondents’ symp- 
tomatology and the socioeconomic status of their respondents’ parents, the investi- 
gators suggested that environmental deprivation in childhood is a causal factor in 
psychopathology. However, they also found that the relation between parental 
socioeconomic status and impairing symptoms was weaker than the relation of 
respondents’ own socioeconomic status to impairing symptoms. Moreover, respon- 
dents rated impaired were most likely to be found among those who were down- 
wardly mobile relative to their parents, and least likely to be found among those who 
were upwardly mobile. Accordingly, the Midtown researchers concluded that per- 
haps both social stress, in the form of childhood deprivation, and social selection, 
in the form of intergeneration mobility, contribute to the strong inverse relation 
between psychopathology and the respondent’s own socioeconomic status (159, pp. 
212-13, 228-29). However, as implied by Michael, one of the Midtown researchers, 
genetic predisposition could, with equal plausibility, be substituted for childhood 
deprivation in this interpretation (159, pp. 329-30). 

Like Jarvis’ research, the more recent epidemiological investigations have been 
cross-sectional, each conducted at one time and in one place and without experimen- 
tal controls. Like his research, they were not designed to test competing social stress 
versus social selection hypotheses. Thus it appears that it is the Zeitgeist that 
changed from his time to theirs far more than the empirical basis for choosing 
between opposing social stress and social selection explanations of relations between 
social location and psychopathology. 


TOWARD A RESOLUTION OF THE SOCIAL STRESS-SOCIAL 
SELECTION ISSUE 


A number of recent studies with data on social mobility have shown that social 
selection processes play a part in relations between social class and psychopa- 
thology, especially schizophrenia and antisocial personality (e.g. 12, 42, 60, 144, 
170). How large a part they play relative to social stress, however, has remained at 
issue (33) and has, in fact, become a matter of sharp controversy in efforts to explain 
the inverse relationship between social class and schizophrenia (92, 93, 114). 

The social stress-social selection issue is a key one. Its resolution would move 
research on relations between social and cultural factors and psychopathology off 
the dead center to which the cumulative results of the “true” prevalence studies have 
brought us—a center marked out by vivid, consistent, and highly challenging rela- 
tionships that lend themselves all too promiscuously to equally vivid but opposing 
theoretical explanations. Let us, therefore, summarize our own approach to this 
venerable issue by way of an illustration of where we think this field will be moving 
in the future. 
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A Quasi-Experimental Strategy for Testing of the Social Stress-Social 
Selection Issue 


More impressive research has been done on the etiology of schizophrenia thus far 
than on any of the other types of psychopathology with no known organic basis. 
The strongest of this research has been made possible by the identification of natural 
experiments. The prime examples are the control of heredity provided by the phe- 
nomenon of monozygotic twins (e.g. 62) and the strategies centering on adoption 
(e.g. 147). In such strategies, however, the powerful contrasts so far have been on 
genetic rather than on the social environmental side of the nature-nurture issue. Are 
there other natural contrasts that would make it possible to develop quasi-experi- 
mental strategies for investigating the etiological implications of the relation be- 
tween social class and psychopathology? 

We have argued that at least one such set of conditions does exist and that it 
provides, potentially, a key to a crucial test of the social stress—social selection issue 
for types of psychopathology such as schizophrenia and antisocial personality that 
are inversely related to social class (28, 33). The set of conditions is provided by the 
processes whereby ethnic groups are assimilated in relatively open-class societies. 
Let us try to summarize the main ideas involved in this quasi-experimental strategy. 


SOME SOCIAL HISTORY Jarvis reported in 1855 not only that paupers had dispro- 
portionately high rates of insanity but that disproportionately high numbers of the 
poor and insane were new Irish immigrants. While he thought that both environ- 
mental factors and heredity were responsible, he believed that the latter was more 
important, speculating that: 


The Irish laborers have less sensibility and fewer wants to be gratified than the Americans, 
and yet they more commonly fail to supply them. They have also a greater irritability; 
they are more readily disturbed when they find themselves at variance with the circum- 
stances about them, and less easily reconciled to difficulties they cannot overcome. 


Unquestionably much of their insanity is due to their intemperance, to which the Irish 
seem to be peculiarly prone ... (80, p. 62). 


The reason for his findings, he implied, was selective migration to the United States 
by those Irish whose inherited weaknesses had predisposed them to both poverty 
and insanity (80, pp. 62-63). Consider Jarvis’ views on this matter in the context 
of the history of ethnic groups in New York City. 

The urban history of New York has been marked by successive waves of new 
immigrant groups: the Irish and Germans in the 1840s, the Jews and Italians 
starting in the 1880s, the Blacks after World War I, and the Puerto Ricans after 
World War II (56). With the possible exception of non-Jewish Germans, the initial 
conditions of these new groups in the city, like the Irish of Jarvis’ time in Massachu- 
setts, were poverty, slums, and working-class jobs. The Jews, the Irish, and, to a 
lesser extent, the Italians have moved up over succeeding generations into relatively 
affluent and largely middie class circumstances in New York. In this process of 


SOCIAL AND CULTURAL INFLUENCES ON PSYCHOPATHOLOGY 445 


assimilation, these three ethnic groups have achieved a substantial share in the 
wealth and power of the city, as have, of course, the Irish in Massachusetts. 

It has often been thought that one of the tragedies of the human condition is that 
man does not live long enough to become wise. This is certainly true on this 
particular count for Jarvis. If he were alive today, he would see that the Irish as a 
group have indeed assimilated to middle class circumstances, sharing in the wealth 
and power of the state, and he would be forced to abandon his hypothesis about the 
constitutional inferiority of this immigrant group. In the light of the history of ethnic 
assimilation, it seems an equally poor explanation for the low status of Blacks and 
Puerto Ricans, the newest groups of immigrants to New York City, who are now 
concentrated geographically in its slums and occupationally in its low-paying un- 
skilled and semiskilled jobs. 


THE ALTERNATIVE PREDICTIONS Members of new immigrant groups in our urban 
centers have faced great social pressures in the form, for example, of prejudice and 
discrimination. This was true for the Irish immigrants of Jarvis’ time (e.g. 66); it 
is true for Black and Puerto Rican New Yorkers in ours (e.g. 56). It is hardly 
surprising, therefore, that from a social stress orientation, we would predict that 
Blacks and Puerto Ricans would have higher rates of psychopathology than their 
class counterparts among more advantaged ethnic groups such as WASPS, Jews, 
and Irish in New York City. 

Far less apparent, however, is that from a social selection orientation, we would 
predict just the opposite. For if social pressure is greater on disadvantaged ethnic 
groups such as Blacks and Puerto Ricans, we would expect more of their healthier 
members to be kept in lower class positions, thereby diluting the lower class psy- 
chopathology. By contrast, with less pressure to block them, the tendency of healthy 
members of more advantaged ethnic groups to rise or be able to maintain the higher 
class status to which the groups had been assimilated would leave behind a relatively 
undiluted residue of disabled persons. Thus social selection processes should func- 
tion to give a higher rate of psychopathology among WASPs, Jews, and Irish than 
among Blacks and Puerto Ricans, social class held constant. 


State of the Evidence 


If the theoretical analysis briefly summarized above is correct, we may have some- 
thing quite rare—a major substantive issue that could turn on what deceptively 
appear to be simple questions of fact: the question of whether, for example, rates 
of psychopathology are higher or lower among Blacks and Puerto Ricans in New 
York City relative to the rates for their class counterparts in more advantaged ethnic 
groups. We could substitute other sets of advantaged and disadvantaged ethnic 
groups in other urban centers for Blacks and Puerto Ricans in New York City in 
the question; for example, Indians and Pakistanis versus other Londoners or south- 
ern migrants versus northern residents of Milan. Such replications would provide 
a cumulatively powerful test. 
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Some limited but nevertheless informative comparisons may soon become avail- 
able from Robins’ extension of her remarkable program of life history research on 
antisocial personality (144) to a follow-up study of lower class Black schoolboys 
(145), Langner’s following after 5 years of a children’s sample that included Black 
and Puerto Rican teenagers (96), and from further analysis of Dunham’s data on 
schizophrenia in a Detroit community that contains lower class Blacks and Whites 
(41). By and large, however, the facts to answer this question are not available from 
previous research. The “true” prevalence studies do not provide adequate data on 
different types of psychopathology for contrasting ethnic groups with social class 
controlled (33). Nor are measures currently available for which there is persuasive 
evidence of validity, particularly when comparisons are to be made across ethnic 
groups. 

Thus the two parts of the legacy from the “true” prevalence studies have become 
inseparable: It is impossible to resolve the substantive issues that they pose without 
coming to terms as well with what we described as the Pandora’s box of methodolog- 
ical problems involved in the conceptualization and measurement of psychopa- 
thology in general populations. Since measures used in the epidemiological studies 
have not been adequately tested for validity, the problem has become one of how 
to test existing measures and/or develop and test new measures. Our views on how 
this problem can best be tackled, as well as some of the results of our ongoing efforts 
to solve it, have been presented elsewhere (29-32, 36-39). 

We believe that there are four key sets of problems to be solved in conceptualizing 
and measuring psychopathology in contrasting class and ethnic groups from the 
general population: 

l. Cultural differences in modes of expressing distress. 
2. Relations between physical illness and some types of psychiatric symptomatology 

— especially those thought of as psychophysiological. 

3. Relations between symptoms of disturbance of cognition, affect and volition on 
the one hand and ability and disability in role functioning on the other. 

4. The extent to which symptomatology is situationally specific and the extent to 
which it is relatively independent .of changes in external circumstances. 

Barring breakthroughs in biochemical research, we believe that how such matters 

are dealt with will determine in large part the adequacy of measures of psychopa- 

thology in general populations. 


CONCLUSIONS 


A body of epidemological data on the “true” prevalence of psychopathology and 
its distribution in community populations around the world has accumulated since 
the early 20th century. These data suggest that some currently popular speculations 
about relations between social and cultural factors and psychopathology do not 
represent the best hypotheses to guide future research. The results of these studies 
give pause, for example, to speculations about an increase in psychopathology as a 
function of the stresses and strains of modern society, and about an excess of 
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psychopathology among women as a function of their changing role in modern 
society. 

While these studies taken cumulatively have, we believe, generated less parochial 
hypotheses than any single study or any set of studies done in one area or period, 
they do not settle the basic issues about the role of social and cultural factors in 
psychopathology. Nor do we think that future research will settle these issues until 
it moves from cross-sectional surveys to quasi-experimental studies that exploit 
contrasts provided by ongoing social processes in order to test specific alternative 
hypotheses. 

This field has reached the stage where a fascinating body of descriptive data has 
posed a set of major substantive issues. It is moving toward the stage in which 
increasing sophistication will be brought to bear on the methodological problems 
that stand in the way of resolution of these issues. We would suggest that it is 
precisely at this point in its development that a field becomes an exciting one in 
which to work. 
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INTRODUCTION 


In attempting to organize the mass of material we felt deserved coverage, we found 
it necessary to impose a classificatory structure on it. The structure we imposed is 
in terms of the nature of the dependent variables and the number of populations of 
measurements (e.g. treatments) being explored. There is a considerable degree of 
arbitrariness in our classification scheme, and the classes turn out not to be fully 
mutually exclusive. We feel, however, the reader will find the system useful and 
convenient. We have attempted to write this chapter at a level which will be 
accessible to the sophisticated researcher in psychology and to the relatively ad- 
vanced graduate student. Some papers cited, however, require a considerable degree 
of mathematical sophistication on the part of the reader for full understanding. We 
wish to encourage readers to realize that it is not necessary to be able to follow 
mathematical proofs in detail in order to apply the results. 

Our classification scheme’s first order concerns whether single or multiple depen- 
dent variables are under study; that is, the traditional dichotomy between multivari- 
ate and univariate analyses. The second order of classification considers whether a 
single or multiple populations (treatments, experimental conditions, etc) are in- 
volved. With single populations, the questions considered are ordinarily the testing 
of some hypothesis or fitting of some structural model; multiple populations ordi- 
narily involve testing hypotheses about population differences. 

So far we have a fourfold order of classification. The remaining order of classifica- 
tion is in terms of the nature of the measurements involved and has three levels: 
(2) continuous interval scale data; (b) ordinal data, and (c) categorical data. These 
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levels appear to be somewhat idealized, for in practice they almost reduce to the 
following categories: 

(a) Continuous interval scale—ordinary parametric analyses with normality as- 
sumptions. 

(b) Ordinal data—nonparametric analyses. 

(c) Categorical data—dichotomous data or chi-square analyses. Some cells of the 
resulting twelvefold system were found to contain so little of interest that they are 
not mentioned at all. 


SINGLE DEPENDENT VARIABLE, SINGLE POPULATION 


Continuous Interval Data 


This section could contain discussion of classical and Bayesian estimation problems, 
classical (fixed predictor) regression and curve fitting problems, and discussion of 
some goodness-of-fit tests, all involving a continuous random variable. In reviewing 
the literature, we have found it convenient to ignore estimation problems, as we find 
the recent literature on estimation to be too specialized for a general psychological 
audience. The same statement applies to goodness-of-fit tests in this context. Non- 
parametric or distribution-free goodness-of-fit tests are discussed elsewhere. 

Regression analysis will be dealt with fairly extensively in both the fixed and 
random predictor forms. Regression on predictors which are random variates 
should be dealt with in another section, but for economy and ease of treatment, we 
shall not make the distinction here. 

We might remark that in this context, continuous generally can be interpreted to 
mean normal random variates. 

This section is broken down into a number of subtopics. At the outset we wish 
to bring attention to the book by Draper & Smith (132) that the user of regression 
analysis will find helpful. In addition, the book by Goldberger (182), written for an 
audience of economists, provides fresh and novel insights into many aspects of 
regression analysis ordinarily ignored by psychologists. 


REGRESSION MODELS The question of whether to use multiple regression weights 
or some simpler alternative has been discussed in three papers. Wang & Stanley 
(485) provide a lengthy review of theoretical and empirical work on differential 
weighting problems generally. Claudy (89) explored different weighting procedures 
empirically, and Schmidt (430) by Monte Carlo techniques. Both conclude that for 
small samples relative to the number of predictor variables, equal raw score weights 
should be preferred. 

A series of thoughtful papers by Werts & Linn (491, 493) and Linn, Werts & 
Tucker (321) merit considerable study by psychologists who utilize multiple regres- 
sion techniques. These papers are directed at the problem of inferring the magnitude 
of an effect in studies involving school and student characteristics and in choice of 
a change or growth variable as the dependent variable in regression analysis. In one 
paper they demonstrate that hypotheses of school effects can either be rejected or 
confirmed, depending on the method of analysis with a given set of data. These 
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findings, of course, have considerable generality beyond the school effects model 
they take as a paradigm. 

Conger (97) presents a unified treatment of the suppressor variable problem and 
introduces the notion of reciprocal suppression. Papers by Rutemiller & Bowers 
(425), Jacquez, Mather & Crawford (247), and Lancaster (310) take up the problem 
of heteroscedastic regression from different points of view. The first deals with a 
situation in which the error about regression is linear in the fixed predictors. A 
maximum likelihood technique is developed and compared with ordinary least 
squares in two sets of data. The maximum likelihood methods clearly perform 
better. In the second paper, no specification is imposed on the variance about 
regression, but it is assumed that several replicates of each value of a single predictor 
are available. Weighted least squares, maximum likelihood, and ordinary least 
squares are compared by Monte Carlo methods. It is recommended that weighted 
least squares be used whenever the number of replicates per cell exceeds the number 
of cells. Lancaster (310) deals with yet another approach in which the error about 
regression is a power function in a single predictor variable. He develops grouping 
estimators based on the mean values of dependent and independent variables in 
upper and lower sections of the range of the independent variable. 


STATISTICAL ISSUES Several other papers deal with statistical problems in regres- 
sion analysis. Gardner (168) explores the bias in variance about regression estimates 
when a regression equation from one sample is applied to another. He treats a 
variety of situations. King (287), in an excellent paper, deals with the question of 
what estimate of a regression coefficient is best in a new sample. He shows that a 
modification of the traditional weight is best over a large part of the sample space 
when sample size is small. Sclove (434) deals with a modified James-Stein estimator 
for regression weights. Aitkin (9) develops simultaneous fixed-width confidence 
intervals for a pair of regression lines. Davies (121) discusses the use of absolute 
deviations rather than squares. 


STEPWISEREGRESSION The technique of stepwise multiple regression analysis con- 
tinues to elicit a substantial degree of attention (133, 309, 333, 339, 418). Generally, 
stepwise multiple regression is becoming known as a method within the framework 
of data analysis, rather than strict statistical methodology (93). An excellent paper 
by Anscombe (16) describes the potential of an interactive computer-based stepwise 
system in which the investigator chooses the variable to be entered or deleted at each 
step. Several papers (159, 226, 309, 359) deal with the problem of examining all 
possible regressions for determination of the best possible subset of a given size. 
Beale, Kendall & Mann (36) provide a useful discussion, with examples, of this 
approach. Williams & Lindem (496, 497) describe an interesting procedure for what 
could be called setwise stepwise multiple regression in which subsets of variables 
play the role ordinarily taken by individual variates. 

Draper, Guttman & Kanemasu (130) show that the significance level for entry 
of a variable in stepwise analysis should be approximately pa, where p is the 
number of predictor variables instead of a. Their results were obtained assuming 
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mutual independence of the predictors but may provide a useful rule of thumb for 
the commonly encountered situation. 


MISSING DATA Several papers attack the problem of missing data in regression 
analysis. Afifi & Elashoff (2-5) consider the two-variable case with missing data on 
both X and Y variates. They discuss several methods of estimation and provide 
detailed tables and analyses of their behaviors under different conditions. Dagenais 
(115-118) considers problems that arise when predictor variable data only are 
missing. Guertin (197), using data from a large scale study, concludes that there is 
little gained by adopting any procedure other than that of simply ignoring missing 
computing intercorrelations from available data and then entering this correlation 
matrix into a multiple regression program. Kelejian (281) studies a similar problem. 


NONLINEAR REGRESSION Nonlinear regression problems have attracted attention 
recently. In this connection we include the use of moderator variables and piecewise 
regression. Moderator variables have been studied extensively by Zedeck (503-505) 
and Conger (96). Piecewise linear (and nonlinear) regression has been studied by 
Bogartz (62, 63), Wainer (482), and others (311, 343). Classical nonlinear regression 
problems and procedures have been studied extensively (28, 449). An excellent pair 
of papers by Sclove (435) and Hoel (229) merit careful reading by any investigator 
interested in nonlinear regression. Thornby (466) has developed an extremely useful 
distribution-free test of linear regression against a general class of concave alterna- 
tives. A related problem is that of estimating linear regression coefficients subject 
to restraints (i.e. positivity); Mantel (332), McGuire et al (344), and Lovell & 
Prescott (327) have explored this problem and proposed solutions. 


SINGLE DEPENDENT VARIABLE, MULTIPLE POPULATIONS 


Continuous Interval Data 


The problems we consider here deal with the reasonableness of assumptions and 
meaningfulness and appropriateness of statistical results. These problems have been 
the topics of numerous recent papers, but they appear primarily in statistics jour- 
nals, or are written in a mathematical jargon that makes them next to impossible 
to comprehend, let alone apply. However, there are a number of results that are 
readable, if you have a Berlitz-like phrase book for mathematics and realize that you 
don’t have to read the proofs or even the theorems in many of the articles. 
What are the problems? Problem A: The cost of collecting the data will constrain 
the completeness of any study. The more one gets involved in a study, the larger 
the number of factors in the design will become; and there has been some progress 
in the recent past in constructing designs that make the most out of a given amount 
of data. For example, careful consideration of the alternatives to the null hypothesis 
that are of most interest, and use of assumptions of zero higher level implications 
before experimentation, can reduce the size and number of samples. Problem B: The 
choice of the best testing procedure can make a great deal of difference in the 
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usefulness of data relative to theoretical questions. There have been a larger number 
of recent studies of the relative power and the robustness of competing test proce- 
dures and a much smaller number of creative proposals for new and better test 
procedures. Problem C: Badly behaved data, which violate some of the assumptions 
of the simplest testing procedures, seem to occur with a very high frequency, and 
a number of ways of legally fudging such data have been developed; for example, 
transforming the data to obtain normally distributed errors. In order to apply these 
procedures, all that is needed is a method of evaluating statistical significance for 
these new procedures. The work reviewed below is partitioned among these three 
problems, although some fudging is necessary due to the overlap among the prob- 
lems, and some subjective interpretation is used, such as the assumption that un- 
equal sample sizes constitute badly behaved data. 


PROBLEM A The problem is to decide the pattern of sampling to jointly minimize 
the cost of the experiment (by minimizing the total sample size) and maximize the 
precision of the conclusions reached, subject to the constraints of the assumptions 
made. For example, if one can assume that the interactions between some factors 
in an experiment are zero, one need only run a small subset of the groups that would 
otherwise be needed, or if one can tolerate less precision (larger variances of esti- 
mates of parameters or larger probabilities of wrong conclusions from tests), fewer 
replications within each experimental group are necessary. Cohen (94), Bowman 
(66, 67), Hodges & Lehmann (228), and Tiku (470) provide tables for the relations 
among sample size and the probabilities of Type I and Type II errors for standard 
t-tests and analyses of variance (ANOVA). 

There are three directions in which progress has been made recently in improving 
upon the standard designs. First, there is continuing work on randomized blocks 
designs (in which some or all of the factors are varied within experimental units, 
such as subjects). Second, there has been a great deal of work on incomplete blocks 
and fractional factorial designs (in which only a few of the combinations of the 
independent factors are run). Third, one can consider a sequential design in which 
a decision is made at various points during the experiment of whether to continue 
sampling or to stop and state the conclusions. For an introduction to designs of the 
first two types, there is an excellent book by Kirk (288) and reviews of recent 
advances by Addelman (1) and Herzberg & Cox (223). Sequential designs are 
thoroughly covered in De Grott’s book (125). 

In order to read many of the articles on experimental design, especially those 
dealing with unequal sample sizes (unbalanced designs, in general), one must 
become familiar with the notation of the design matrix and the concept of the 
generalized inverse of a matrix, because most authors have switched to this notation 
because it is much more compact than the usual summation notation. Briefly, in 
these terms, the data (Y) are an N X 1 vector and are assumed to be linearly related 
to an M X 1 vector of parameters (8) by the equation Y = X9 + E, where X is an 
N X M matrix of ones and zeros indicating the contributions of the parameters to 
the data and E is a vector of random errors that are subject to the usual assumptions. 
Now SIN would yield a vector of estimates of the parameters, but unless M = N 


458 MEREDITH, FREDERIKSEN & McLAUGHLIN 


and X has full rank, X~! is not defined. Therefore, the generalized inverse of X OC 
X)-!X', has been developed and used extensively. See, for example, Searle (436) for 
this concept. 


Problem A, solution 1 Randomized blocks designs improve the precision of the 
estimation of parameters at the cost of requiring that one assume that the correla- 
tional structure of the observations within units is simple. ANOVA for single subject 
designs is set forth in articles by Shine (445, 446), and the rationale for such 
randomized and block designs, in general, is discussed by Youden (502) and Box 
& Guttman (68). Pearce (377, 378) investigates the relative efficiency of blocks 
designs, and Calinski (81) describes some desirable patterns in block designs. Koch 
(291) and Rees (410) present analyses of classes of randomized blocks designs called 
“split plot” designs, and Naruboodiri (368) describes some “changeover” designs. 

One problem with repeated measures within a unit is to minimize undesirable 
sources of variance. Atkinson (23) gives a correction for the effects of measures on 
adjacent measures, and Phillips (381) gives tables for order of observations which 
minimize the linear trend effect across repeated measurements. 


Problem A, solution 2 Incomplete and fractional designs purposely confound 
different sources of variance in order to reduce the size of the experiment. In order 
to use these designs, the experimenter assumes that certain higher order interactions 
are zero to eliminate the confounding. This allows one to investigate a large number 
of factors in an experiment that is not prohibitively large. For example, if one is 
willing to assume that all third-order and higher interactions are zero but wants to 
estimate all main effects and second-order interactions (called a Resolution V plan), 
he can investigate 10 two-level factors with 56 experimental runs, instead of 1024 
runs. Addelman (1) reviews most of the recent work on fractional factorial designs. 
An alternative method of choosing a factorial design, the |X' X| criterion, is described 
by Box (70); and a method for expressing experimental contrasts in factorial designs 
as linear combinations of a set of basis contrasts is developed by Das & Jain (120). 

Latin squares are the best known type of confounded designs and are covered by 
Kirk (288). Pearce (375) points out several variations on Latin squares that may be 
of interest if you wish, for example, to test several treatments against a control. 

There have been several papers (Pearce 376, Clatworthy 88, Afonja 6, Basson 34, 
Cunningham & Henderson 110, Robinson 417, and John, Wolock & David 252) on 
incomplete block designs; that is, designs in which each block receives only a subset 
of the treatments. These papers are concerned with procedures for generating the 
combinations of treatments and blocks, with procedures for estimating parameters 
in such designs, and with the relative efficiencies of the designs. 

Finally, two recent papers have investigated the consequences of using nested 
designs in which different numbers of levels of the nested factors appear within each 
level of the higher stage factors, unbalanced nested designs. Leone et al (313) present 
examples of the method for analyzing these designs; and Goldsmith & Gaylor (184) 
generate and evaluate the relative efficiency of 61 different three-stage unbalanced 
designs. 
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Problem A, solution 3 Sequential sampling procedures are characterized by the 
performance of an intermediate statistical test on a subset of the potential data to 
be collected to determine whether it is worthwhile to collect more data. Therefore, 
they include the cost of collecting the data as a relevant term in the intermediate 
statistical test, which must be balanced against the cost of making an error in the 
conclusions of the study. In most cases, the form of the intermediate statistical tests 
is analogous to standard tests, although the interpretation is different. A thorough 
understanding of these procedures depends on an understanding of stochastic pro- 
cesses in general, as presented, for example, in De Groot’s book on optimal statisti- 
cal decisions (125). However, several recent articles have spelled out specific 
procedures that can be applied without a great deal of sophistication in stochastic 
processes. 

For a fairly straightforward problem, the estimation of a population mean, two 
recent studies propose two-stage procedures. Arnold & Al-Bayyati (18) considered 
the problem of estimating the mean when a prior estimate is available. If the sample 
mean of a first sample is sufficiently far from the prior estimate of the population 
mean, a second sample is collected and the pooled mean used as the estimate. This 
procedure 1s shown to compare favorably with the usual procedure in efficiency. In 
a different case, where two populations are known to have a common mean but 
differing variances, Alam (10) developed a procedure which consists of initially 
balanced sampling from the two populations, and after determining which has the 
smaller variance, continued sampling from the population with the smaller variance. 
In particular, he solves the problem of determining how large the initial samples 
should be when the total of all the samples is fixed. 

For the Behrens-Fisher problem, deciding whether two population means differ 
when the variances are different, Robbins et al (415) developed a sequential proce- 
dure which holds the standard deviations of the sample means from the two popula- 
tions approximately equal. A nonparametric procedure for testing equality of 
location of two samples, a sequential signed rank test, was developed by Miller (355). 

For the comparison of more than two populations, Ghosh (175) has developed 
a sequential procedure for various ANOVA designs; and Bechhofer et al (39) have 
developed sequential procedures for choosing the best (highest mean) of several 
populations with unknown variance. 

Finally, before leaving Problem A, we mention two contributions to the sampling 
problem by Sedransk. In one study (439) he describes a sampling scheme for 
multifactor designs when the error variance is known, based on criteria of prescribed 
precision of estimates; and in another paper (440) he investigates the problems 
involved in using systematic sampling as opposed to random sampling. 


PROBLEMB Most of the studies reviewed in the section on X-group univariate data 
analysis were concerned in one way or another with the choice of estimation and 
hypothesis testing procedures, developing new procedures, extending procedures, 
and evaluating the relative efficiencies of alternative procedures. The type of proce- 
dure you will use depends on whether you are interested in measures of location, 
or measures of dispersion, or the shape of a distribution. 
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The recent work in this area falls roughly into 11 categories, most of which are 
related to ANOVA, and we shall consider these categories sequentially. 


Preliminary tests A preliminary test procedure is a statistical test whose outcome 
determines which of alternative tests of a main hypothesis will be used. Kale & 
Bancroft (279), Han & Bancroft (210), and Arnold (17) have investigated proce- 
dures for estimating the mean of a population which use the sample mean along with 
the mean of a sample from a second population if the two sample means are not 
too far apart according to a preliminary test. Mehta & Gurland (351) and Davis & 
Arnold (123) present analogous procedures for the estimation of a population 
variance, using information from a sample from a second population. 


Testing equality of variances Gartside (169) compared empirically seven different 
methods of testing for homogeneity of variance in X groups and made recommenda- 
tions based on relative power that were based partly on what kind of departure from 
homoscedasticity one expects. Generally, Bartiett’s statistic compared favorably 
with the others. Sugiura & Nagao (461) developed mathematical results for Bart- 
lett’s test and compared it with Lehmann’s asymptotically UMP invariant test. Burr 
& Foster (79) presented yet another test for equality of variances; and Han (209) 
has developed a multiple correlation test for use in two-factor designs. Bechhofer 
(38) has developed a method of ranking the variances of a set of populations to be 
used, for example, to determine which population has the smallest variance for 
further sampling. 


Testing normality The second of the two main assumptions of the ANOVA model 
is that the errors are not only distributed with equal variances but are normally 
distributed. Shapiro, Wilk & Chen (443) compared 9 tests for normality empirically 
for 45 alternative distributions and recommended use of a statistic developed previ- 
ously by Shapiro and Wilk. Wilk & Shapiro (495) extended the use of this statistic 
to the joint assessment of normality of several independent samples. D’Agostino & 
Cureton (119) presented a test for normality that is powerful against skewness. 


Robustness Robustness of a procedure refers to the invariance of the outcome of 
the procedure with respect to variations in the true error distribution from that 
assumed in developing the procedure. In ANOVA, the variations are from homo- 
scedasticity, normality, or independence of the errors. If you question these assump- 
tions for your particular data, it is recommended that you use either nonparametric 
procedures or a function to transform your data prior to analysis. ANOVA has a 
reputation for being robust, although Bradley (71) has attacked this reputation in 
a large empirical study which found that when assumptions were violated, the 
significance levels became dependent on such factors as sample size and correlation 
of estimates of the mean and variance. For departures from normality, Arvesen (19) 
applied Tukey’s jackknife method to provide a more robust ANOVA; and for 
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departures from homogeneity of variances and covariances in repeated measures 
designs, Collier et al (95) presented an adjusted F-test which improved upon the 
standard procedure. 

A particular deviation from normality has received continued attention recently: 
the case of (very) discrete measures. Hsu & Feldt (237) investigated the effect on 
the one-way F-test of using from two- to five-point scales for criterion scores and 
found only small differences in the test outcome from continuous scales. In 2-way 
designs, Gabrielsson & Seeger (163) found similar results with dichotomous data. 
In case you are still worried about the problem, McNeil (347) has furnished results 
of exploring consistent statistics in this case, using inverses of the assumed “discre- 
tizing” or grouping function. 


ANOVA procedures There have been several recent papers on methods for per- 
forming the analysis. Most of them are on the difficult problem of estimation of 
variance components in experiments with unequal n’s and will be mentioned under 
Problem C. A Fortran program for ANOVA is given in (80) and a PL/I program 
is described in (164). Cunningham (109) describes an algorithm for unusually large 
designs in ANOVA and analysis of covariance (ANOCOVA). Jones (257) gives a 
procedure for use with stratified subsamples. Pinneau & Ender (385) have shown 
the relationship of univariate ANOVA to principal component analysis of the 
correlations between treatment levels and observations, and Mandel (330) has given 
a method of analyzing interactions in ANOVA designs in terms of principal compo- 
nents. Hogg (230) describes a more powerful ANOVA procedure for use when the 
alternatives to the null hypothesis are restricted. To aid in the planning of the 
analysis of designed experiments, Anderson (14) suggests the conceptual use of a 
“restriction” source of variance which has no degrees of freedom or associated sum 
of squares but serves to clarify the analysis of experiments in which randomization 
is restricted, as in randomized blocks designs. When the analysis problem is simply 
to choose a subset of the treatments which contains the treatment with the smallest 
population mean, a procedure described by Desu & Sobel (127) can be used. 

Measures of the magnitude of experimental effects are the topics of papers by 
Fleiss (149) and by Friedman (155, 156). Estimation of the components of variance 
in the random effects model, even when sample sizes are equal, is subject to a defect 
that is the topic of papers by Wang (486) and Federer (143): that the estimates may 
be negative, although the population values are by definition non-negative. Federer 
gave an improvement over the procedure of merely substituting zero for a negative 
estimate, and Wang compared the relative efficiencies of five different estimation 
procedures. 


ANOCOVA A Fortran program for analysis of covariance is described by Walsh 
(484). An article by Reisch & Webster (411) points out that ANOCOVA has uneven 
power with respect to different alternative hypotheses and suggests a procedure to 
reduce that unevenness. Problems in applying and interpreting the results of 
ANOCOVA when the covariates are uncontrolled and may possibly be related to 
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experimental groups are discussed by Werts (493, 494), Mallios (329), and Sprott 
(455). The use of ANOCOVA for the analysis of paired comparison data is set forth 
by Bechtel (40). 


Residual error Winer (499) discussed several aspects of the concept of error in 
statistical models, and Cox & Snell (105, 106) and Andrews (15) investigated the 
general definition of residuals and proposed statistical tests based on residuals. 
Johnson & Graybill (255) present a method for estimating error variance in a 
two-way design with interaction, based on the pattern of 2 X 2 contrasts; and 
Gnanadesikan & Wilk (181) present a maximum likelihood estimation procedure 
for the error variance in complex designs. Hill (225) broadens the one-way ANOVA 
procedure to allow for negatively correlated residuals. 


Multiple comparisons When more than two groups are present in an experiment, 
the investigator is usually interested not only in whether there is a significant main 
effect but also in which groups differ significantly from each other. The problem of 
evaluating this latter significance is nontrivial because of the multitude of tests being 
simultaneously performed on the same data, and there have been many suggested 
procedures. Games (166) and Aitkin (7) have recently surveyed alternative proce- 
dures. Rodger (419, 420) has proposed an alternative version of Scheffe’s procedure. 
Smith (450) has recently shown that Tukey’s “honest significant difference” proce- 
dure is robust with respect to the requirement of equal group sample sizes. Bech- 
hofer (37) described a multiple comparison procedure for group variances, related 
to his earlier paper (38). 


Problems of inference Comparisons of alternative methods of combining the sig- 
nificance levels of independent test statistics have been conducted by van Zwet & 
Oosterhoff (479) and by Littell & Folks (322). Although there is no uniformly best 
procedure, Fisher’s method compares favorably with its competitors. Concerning 
the question of experimenter bias, Rosenthal (421) criticized a review by Barber & 
Silver (26) which indicated that experimenter bias is not a frequent or strong effect. 
Levin & Marascuilo (315) discuss the problem of Type IV errors, that is, making 
the wrong conclusion after performing an admissible data analysis. 


Likelihood and Bayes procedures Comparison of the likelihood functions for data 
under competing hypothetical models is a common method of developing a statisti- 
cal test, and the large sample approximation to the logarithm of the likelihood ratio 
is well known. Sprott & Kalbfleisch (456) point out that with computers, however, 
one can frequently avoid using the large sample approximation and that for some 
examples the approximation is not a good one. 

An introduction to Bayesian procedures is given in De Groot’s book (125). A few 
authors recently applied Bayesian procedures, the alteration of a priori hypotheses 
as a result of the ration of likelihood functions, to ANOVA designs. Box & Tiao 
(69) apply Bayesian estimation to the one-way random effects model; Farrell (142) 
considers two-way designs; Tiao & Box (468) consider nested designs; and Tiao & 
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Draper (469) apply Bayesian analysis to balanced incomplete blocks designs. Other 
recent papers on Bayesian procedures are by Ericson (141) on estimation of a 
population mean; by Hudimoto (240) on an empirical Bayes approach to multiple, 
repeated, decision problems; by Pitz (386) comparing Bayesian and standard proce- 
dures for a perception experiment; and by Maritz (337) on an empirical Bayesian 
approach to the compound decision problem. 


Other estimation and testing procedures Seven papers reviewed do not fit into any 
of the categories above but are worth mention. Blumenthal (56) has compared five 
procedures for estimating the larger of two population means. Gardner (167) devel- 
oped a procedure for detecting whether changes occur in the mean of a sequence 
of n observations. Salsburg (426) demonstrated the use of the jackknife procedure 
for testing proportions near zero or one. For testing whether a set of binomial, 
multinomial, or Poisson distributions have homogeneous parameters, Potthoff & 
Whittinghill (390, 391) have described testing procedures. Slakter (447) empirically 
investigated an estimate of the power of the chi-square goodness-of-fit test. Finally, 
Juola (272) has developed a criterion for deciding whether a population is symmetri- 
cally distributed; a continuous random variable X is symmetric if and only if 
prix < 2/X = 1/2) for all t > 0. 


PROBLEMC In this section we consider procedures for handling data which do not 
fit into more standard models because of the violation of basic assumptions. There 
are two solutions to the problem: to transform the data to a form that does satisfy 
the assumptions, or to develop a method for handling the data in their unpleasant 
form. We first consider papers dealing with the first solution and then papers dealing 
with the analyses of data which are heteroscedastic or have unequal sample sizes. 


Transforming to normality Kowalski & Tarter (295) demonstrated that trans- 
forming to normally distributed scores improved the test for correlation and dis- 
cussed general methods for performing the transformation. Riffenburgh (413) 
considered the problems involved in transforming a bounded random variable to a 
normal, and therefore unbounded, variable. Draper & Cox (129) consider a power 
transformation to achieve normality. A Fortran program for the direct transforma- 
tion to normality by transforming to ranks and then from the ranks to normal 
quantiles, is described by Astin et al (22). 


Transforming to minimize sources of variance Bogartz & Wackwitz (64), Draper 
& Hunter (131), and Fraser (153) have developed transformation procedures for 
fitting simpler forms of the linear model; and in related work, Levine (316, 317) has 
explored transformations that render curves parallel or of the same shape. Schlessel- 
man, in a thesis under Tukey (429), has considered transformations for two-way 
ANOVA which simultaneously reduce interaction and heterogeneity of variance. 


Filling in missing data Preece (392) has recently discussed iterative procedures for 
filling in missing values in such a way that the estimate of error variance is mini- 
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mized in factorial designs. Walsh (483) describes a Fortran procedure for filling in 
missing data in performing ANOVA. 


Heteroscedasticity The problem of testing whether the means of two populations 
differ when the variances differ is known as the Behrens-Fisher problem. Use of the 
Studente 7 statistic in this case leads to erroneous error levels. However, Bhattachar- 
jee (50) and Mehta & Srinivasan (352) have considered approximations to the £ 
distribution which improve the test, as has Patil (374) for the three group case. 
Scheffé (428) compares six different solutions to the problem, and Eaton (136) 
develops a multivariate version of a solution suggested by Scheffé. 

The problem of heteroscedasticity in regression and covariance analysis has been 
the target of procedures developed by Ashford & Brown (20), Bain (25), and 
McLaughlin (346). 


Unequal sample sizes The estimation of variance components in ANOVA when 
the sample sizes are unequal has been attacked recently in at least three different 
ways. Searle (438) discusses many of the problems involved. Hartley & Rao (217) 
and Cunningham & Henderson (111) have developed maximum likelihood proce- 
dures for the mixed model, which has been extended to nested designs; for example, 
by Searle (437) and Rudan & Searle (424). Afonja (6) gives minimal sufficient 
statistics for variance components in randomized blocks and balanced incomplete 
blocks designs. A least-squares estimation procedure is described by Harvey (218). 
Rao (406), Rao & Subrahmaniam (409), and Rao (407) have described a different 
criterion for estimation, the MINQUE or minimum normed quadratic unbiased 
estimator, for linear models. The MINQUE procedure appears to be the easiest and 
most efficient of those methods so far developed. 

Milliken & Graybill (356, 357) have extended the general linear model and 
applied it, for example, to testing for interaction in cases of missing data, Koch (290) 
and Low (328) have developed a procedure for estimation of the population mean 
in random effects models by minimizing a function that is sensitive to shifts in 
location. 

In some cases, the sample size can be treated as a random variable which may 
be related to the effect of the treatment. Bartholomew (32) has treated the problem 
of hypothesis testing when the sample size is random in general, and Harville (219, 
220) has considered the problem for one-way and two-way designs in ANOVA. 


Ordinal Data 


Nonparametric statistics, or rank order statistics, are estimation and testing proce- 
dures that are characterized by a transformation of the data to ranks or to signums, 
indicators of the sign of a difference. The pattern of the transformed data is then 
either compared to chance distributions or is transformed for testing to scores of 
some known distribution, such as the normal distribution. How can one choose 
between a standard procedure and a nonparametric procedure in a given situation? 
The primary criterion is whether the data one has, or the theory that predicts data 
to be observed, deviates from the assumptions of the standard procedure. ANOVA 
assumes normally distributed error, and deviations from normality can, for example, 
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lead to incorrect Type I error levels. It is well known, on the other hand, that use 
of a nonparametric method when the standard procedure is appropriate, loses some 
power or efficiency. The recurring conclusion from recent studies is, however, that 
for many nonparametric methods, this loss is very small, and, in some cases, nonex- 
istent. A secondary criterion is ease of computation, which especially favors the sign 
test as a method of quick, offhand, data analysis. The studies we consider here fall 
into three categories: estimation of location and dispersion, two-sample tests, and 
k-sample tests, for k greater than two. 


ESTIMATION OF LOCATION AND DISPERSION If a choice is to be made of which of 
a family of distributions F;(x) best fits a set of data, and the distribution functions 
are related by the equation F;(x) = F(x -0), then 0 is a location parameter, to be 
estimated. If the equation is F;(x) = F,(x/@), then @ is a dispersion parameter. 
Hollander (232) has considered nonparametric estimates of each type and the rela- 
tions between them, such as correlation. In general, estimates of location will have 
small correlations, if any, with estimates of dispersion. Tamura (463) has investi- 
gated the properties of a two-stage estimation procedure using the median, analo- 
gous to those mentioned above in the section on preliminary tests. Yanagawa (501) 
has developed an estimator of location in the form of a mean of medians and shown 
evidence of its robustness. 

Estimates for scale or dispersion parameters have been developed by Kakeshita 
& Yanagawa (278) and by Prescott (394, 395). Mood’s squared ranks test statistic 
has been refined by Mielke (353), who gave a correction for ties. 


TWO-SAMPLE TEST PROCEDURES A fine but selective review of recent advances in 
this area is given by Savage (427), including nonparametric Bayesian procedures. 
A survey of sign tests is given by Bradley (72), and McNeil (349) has developed 
two-stage testing procedures, the first stage consisting of a kind of sign test which 
is easier to perform than more exact tests. Concerning the Mann-Whitney statistic, 
Buckle (77) have given a better large sample approximation than the normal approx- 
imation, based on the sum of uniform variables; and Hollander (231) has given two 
competitors which compare favorably in efficiency, unless the alternative hypothesis 
is a pure shift in location. Cureton (112) provided an appropriate correction for ties 
in the normal approximation to the Wilcoxon signed rank statistic. A comparison 
between the sign test, the signed rank test, and Fisher’s randomization test was 
carried out by Kempthorne & Doerflor (283), who prefer the randomization test. 
Edgington (137, 138) has proposed an alternative to exact randomization tests, 
which suffer mainly from requirements of excessive computation: one can randomly 
sample among the randomization patterns. McNeil (348) investigated the loss of 
efficiency in several of these procedures when the data are grouped into class 
intervals, found it to be relatively small, but recommends that one break ties by 
randomization rather than averaging ranks. 

The asymptotic efficiency of the two-sample Kolmogorov-Smirnov test was inves- 
tigated by Klotz (289) who found it insensitive to differences of scale; and Steck 
(458) has derived distributions for this test, characterizing it as a rank test. Bell et 
al (41, 42, 44) have developed test statistics based on normal order statistics, either 
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using tables of the expected values of these statistics or drawing a random sample 
from a distribution known to be normal, and have compared this procedure with 
other nonparametric tests. Birnbaum (51) considered the problem of evaluating the 
robust efficiency of two-sample rank tests. A variation of the median test has been 
developed by Gastwirth (170); and Gibbons & Gastwirth (176) have developed rank 
tests based on only the extreme ranks of the two samples. 


MORE-THAN-TWO-SAMPLE TEST PROCEDURES Gilbert (177) has recently conducted 
an empirical study of the relative power of three alternative rank tests for an 
ANOVA design. Doksum (128) and Spjgtvoll (454) provide rank procedures for 
estimation and testing in some designs, Crouse (108) analyzes a 2” factorial design 
with rank sum statistics, and Koch (292) explores rank procedures for complex split 
plot experiments. 

Jonckheere’s X-sample test against ordered alternatives has been investigated by 
Odeh (371, 372) and May & Konkin (338), who provide tables of significance and 
power of the test. Odeh (370) and Conover (99) have developed other rank proce- 
dures for determining whether & groups have identical location. Rizvi (414) have 
described ranking procedures for comparing several groups with a control, and 
Barlow & Gupta (29) have developed procedures for the selection of a subset of 
treatments containing the “best” treatment. 

Quade (402) has developed a rank ANOCOVA, for which Labinowich & Brewer 
(304) have provided a Fortran program. Puri & Sen (400) have generalized and 
refined this analysis. 


CENSORED DATA Censored data are data which are allowed to take on only a 
bounded set of values, although the underlying distribution process may be un- 
bounded. The censoring may occur because of some process inherent in the data, 
or the experiment may trim the sample to avoid erroneous conclusions based on 
“outliers” that contaminate the data. 

Prescott (393) has recently given an estimator of the standard deviation of a 
normal population based on censored samples. Hettmansperger (224) has investi- 
gated the Mann-Whitney statistic for trimmed samples, and Sobel (452) has general- 
ized the Wilcoxon rank sum test to censored samples. Another generalization of the 
Wilcoxon test was developed by Gehan and refined by Mantel (331) and Gehan & 
Thomas (173). Breslow (73) has extended Gehan’s test to more than two censored 
samples. 


MULTIPLE COMPARISONS Nonparametric multiple comparison procedures have 
been developed by Tobach et al (472), Slivka (448), Rhyne & Steel (412), and 
Roberge (416). Slivka’s procedure is for comparison of treatments versus a control, 
and Roberge’s includes the description of a Fortran program for the analysis. 


MULTIPLE DEPENDENT VARIABLES, SINGLE POPULATION 


This section will review work related to the analysis of multivariate data obtained 
from a single population of subjects. In this section we will consider models for the 
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analysis of continuous, ordinal, and categorical (nominal) data, and, with the excep- 
tion of models for the analysis of categorical data which deal with interaction and 
independence in multidimensional contingency tables, this section will not consider 
group differences. Work on multivariate analysis of group differences is reviewed in 
the next section. We will consider first the analysis of continuous multivariate data, 
followed by ordinal and then categorical data. For each type of data, papers will 
be reviewed under the topical headings: structural models, regression, estimation, 
hypothesis testing, and distribution theory. 


Continuous Interval Data 


LINEAR STRUCTURAL MODELS: COVARIANCE STRUCTURES Suppose that we have 
obtained a set of p measurements X), . . „X,on a random sample of n + 1 individu- 
als drawn from a population in which we assume that the jth variate G=1,..., 
p) is distributed normally with mean p; and variance Sei In addition, we assume 
the joint distribution of the p variates is multivariate normal with mean vector 
Wes (His » - Wel and covariance matrix Z = (oy) (a prime indicates the transpose 
of a column vector, oj = oi and o; = cov(X, AE Let x' = (x,,..., Xp) bea 
vector of observed scores obtained from a single randomly sampled individual. A 
linear structural model represents this vector of observed scores as given by linear 
regression on a set of latent (unobservable) random variables plus an additive unique 
(residual or error) component representing that portion of the observed scores which 
is not predictable by linear regression from the latent variables. Thus, if v is a vector 
of scores of a randomly selected individual on r latent variables, z is a p-element 
vector of residual scores uncorrelated with the latent variables, and A(p x r) is a 
matrix of regression weights; the linear structural model is given by the matrix 
equation x = u + Ay + z. The estimation problem involves finding a linear 
structural model which fits the observed scores obtained from a given sample of 
people and inferring the likelihood that the structural model is valid in the popula- 
tion from which the sample of subjects was drawn. 

Since in the linear structural model just described one can replace A with AT and 
y with T-!y (where T is any nonsingular r x r matrix) and not affect the residual 
vector z, in order to obtain a unique set of parameter estimates for the structural 
model it is necessary to impose additional restrictions on the model. Thus, in the 
model just given, it is minimally necessary to specify a measurement scale for the 
latent variables (e.g. by specifying unit variance) and to uniquely identify the ele- 
ments of A by requiring that the latent variables be uncorrelated (or have fixed 
correlations), by setting certain of the elements of each column of A equal to 
specified values (e.g. zero), or by specifying a unique “rotation” for the columns of 
N. In general, the purpose is to make a sufficient number of assumptions so that each 
parameter which remains to be estimated is uniquely specified by the model (i.e. can 
have only one value). Such a model is said to be identified. Dunn (134) has stated 
sufficiency conditions for uniqueness of a restricted factor matrix A in the above 
model, and Werts, Jöreskog & Linn (489) have discussed the identification problem 
for certain ordered-variate models. 


468 MEREDITH, FREDERIKSEN & McLAUGHLIN 


Any linear structural model which does not constrain the mean vector can be 
expressed in terms of the covariance matrix of the observed variables. Thus, if (using 
what has become standard notation) P represents an r x r symmetric matrix of 
covariances of the r latent variables y of the above structural model, and Y repre- 
sents a p x p diagonal matrix containing the variances of the residual variates z on 
its main diagonal, the model can be expressed in terms of the covariance matrix as 
follows: 
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The covariance structure form of a linear structural model is especially convenient 
since all parameters of the model are represented in the model equation. Thus, linear 
structural models are often expressed as models for the structure of a covariance 
(or correlation) matrix. 

Jöreskog (260) has presented a general model for covariance structures which 
includes Model 1 as a special case. Jöreskog’s model assumes that the population 
varlance-covariance matrix has the form 


2 = B (AGA' + Y3 B' +8 2. 


where B = (x) is a p x q matrix of regression weights, A = (A,,,) 18 a g x r matrix 
of regression weights, ® is as defined previously, and the diagonal matrices Y? = 
(Uv?) (q x q) and 8? = (87) (px p) contain unique variances. Note that by setting 
B =I, a p x p identity matrix, and 8° = (0), a matrix of zeros, the model is 
equivalent to No. 1 with g = p. As in Model 1, the mean vector is assumed to be 
unconstrained so that all relevant information from the sample is contained in the 
sample variance-covariance (or correlation) matrix. In the review of work on linear 
structural models which follows, three principal classes of structural models will be 
identified which, in most cases, can be represented as special cases of Jöreskog’s 
general Model 2 but which differ in the particular assumptions that are made 
concerning which elements of the parameter matrices are free (unconstrained), 
constrained (to be equivalent to other elements), or fixed (to specified values). The 
three classes of structural models are as follows: (2) reduced rank models which 
specify r < p in Model 1, ® = I (an r x r identity matrix), and fix the columns 
of A by requiring either that A'V?A be diagonal or that A'A be diagonal; (b) 
classificatory models which may or may not reduce rank and in which a large 
number of elements of A (in Model 1) or A and B (in Model 2) are fixed and set 
equal to zero; and (c) ordered variate models in which parameters in Model 2 are 
fixed in such a way as to impose a stochastic order (or partial order) on the observed 
variables such that one variate is derivable from another by linear regression in an 
ordered sequence. Reduced rank models include test theory models (r = 1), and the 
familiar unrestricted factor analysis (UFA) model (1< r <p). Classificatory models 
include cluster analysis methods and restricted factor analysis (RFA) models in 
which a factor structure is specified by setting factor loadings (in A) equal to zero. 
Variance and covariance components models in which (e.g. in Model 1) A is a 
completely fixed design matrix containing zeros and ones have also been considered 
in which the observed variables are viewed as classified according to an experimental 
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design with random effects. Ordered variate models include simplex models [for 
longitudinal growth or learning data, (cf. 260, 262) and path models (cf. 55, 492)] 
which can be fitted using general Model 2; and also include continuous state stochas- 
tic process models (e.g. certain time series models) and path models which require 
a structural model which is still more general than Model 2. Mixtures of these three 
classes of linear structural models have also been considered. 


Reduced rank models Here we consider special cases of Model 1 in which r < o 
but in which no elements of A are restricted to be zero. This model is the traditional 
unrestricted factor analysis (UFA) model which is appropriate in situations in which 
one either is concerned solely with dimensionality (i.e. the number of factors) or 
wants to examine the configuration of test vectors in common factor space by 
attempting to transform the estimated factor matrix A according to some criterion 
for factor rotation, either analytic (cf. 203, 273, 275-277 for recent work concerned 
with analytic rotation), subjective (graphical rotation to simple structure), or com- 
bining analytic with judgmental criteria (389). For purposes of identification and 
estimation under the UFA model, it is usually required that the factors be uncor- 
related (i.e. that P = I) and that either A'¥-2A be diagonal (the canonical factor 
characterization) (403) or A'A be diagonal (the principal factor characterization). 
In the former case, the columns of Y-A are the r characteristic vectors of 
\Y-ISıp-! (which has rank r and in which S is the sample covariance matrix); in the 
latter case, the columns of A are the r principal components of S — Y?. For purposes 
of obtaining maximumlikelihood (ML) estimates, the former assumption is conven- 
ient. Bentler (46) has provided a rationale for the canonical factor model in terms 
of psychometric sampling by defining a coefficient of internal consistency for an 
observed score composite and then showing that if weights are found which maxi- 
mize this coefficient, the composites thus defined correspond to factors defined by 
the canonical factor model. 

Jöreskog (259) has provided us with an efficient computational procedure for 
obtaining ML estimates for the parameters of the UFA model. Using the fact that 
under the canonical factor characterization for fixed r the conditional likelihood 
function for given Y? can be written explicitly in terms of the p-r smallest character- 
istic roots of deit, Jöreskog directly determines those values of Y? which make 
the conditional likelihood function a maximum using the Fletcher-Powell al- 
gorithm. The algorithm converges rapidly and the maximum likelihood solution can 
be determined very accurately. Jöreskog found that improper solutions (negative 
elements of Y?) occur rather frequently and proposed a general procedure for 
dealing with such cases. Jöreskog’s procedure also provides asymptotic standard 
error of estimate and confidence intervals for all estimated parameters. Browne (75) 
has published an extensive study comparing a number of approximate factor ana- 
lytic techniques with the ML solution for artificial data. Browne obtained ML 
estimates using the “maximum determinant” procedure suggested by Bargmann 
(27) and Howe (236). His approximate techniques included principal components 
with diagonal elements of the sample correlation matrix replaced by squared multi- 
ple correlations, the iterated principle factor method, and the centroid method. 


470 MEREDITH, FREDERIKSEN & McLAUGHLIN 


Browne concluded that ML estimates are superior and should not be rejected for 
reasons of computation time. Browne also found that the ML procedure for deter- 
mining the number of factors—-successive significance tests with different values of 
r—was satisfactory. However, Tucker & Lewis (477) have observed that for a large 
sample of individuals, the likelihood ratio statistic as a criterion for determining the 
number of factors could indicate that an otherwise acceptable model does not 
reproduce the sample covariance matrix. They proposed a reliability coefficient 
which represents a ratio of the amount of variance associated with a model to total 
variance as an index of quality of representation of the sample covariances. 

Jöreskog & Goldberger (268) have proposed that generalized least squares (GLS) 
be considered as an alternative statistical estimation procedure which provides 
estimates that are scale free, asymptotically efficient, and (if vn (S -}) is asymptoti- 
cally multinormal) have the same asymptotic properties as ML estimates (viz. 
limiting multinormal distribution of parameter estimates, the same asymptotic vari- 
ance covariance matrix, and the same asymptotic chi-square test). They proposed 
that the following three criteria be minimized: unweighted least-squares (ULS), 
Ytr[(S-)?]; weighted (generalized) least squares (GLS), “tr[(S-!X—I)?]; and max- 
imum likelihood (ML), tr(2:-'S) — In-!S| -p. Jöreskog & van Thillo (270) have 
programmed rapid algorithms using all three criteria. Each of these iterative proce- 
dures requires that initial estimates of the elements of Y? be provided. Kaiser (274) 
has suggested that when a correlation matrix is analyzed, initially should be set 
equal to an identity matrix since the number of positive characteristic roots on the 
first iteration would then correspond to Guttman’s (199) weaker lower bound for 
the number of common factors. 

The special case of the UFA model in which r = 1 is of particular interest as a 
general model within which various test theory models can be expressed. In particu- 
lar, four structural models have been described by Jöreskog (265, 266) for the 
covariance matrix of a set of congeneric tests (i. e. tests which are assumed to 
measure the same thing). The weakest model (i.e. the model which makes the fewest 
assumptions), the congeneric true score model, assumes only that r = 1 in Model 
l and that the single latent “true score” has unit variance. This model implies that 
all tests are derived by linear regression from a single latent “true score” and does 
not require homogeneous regression or error variance. The strongest model consid- 
ered is the parallel test model of classical true score theory (325) which requires that 
the elements of the column vector A be constrained to be equal and that the elements 
of Y? also be constrained to be equal (equal error or residual variances). Two 
intermediate models can also be considered: a model for tau-equivalent tests (325) 
in which the equal residual variance assumption is dropped from the classical model; 
and a model due to Kristof (298) for tests which differ only in length. The parameter 
estimation program ACOVS (269) for the general Model 2 can be used to obtain 
ML estimates of the parameters of each model and goodness-of-fit test performed. 
The problem of instrument calibration by estimating pairwise linear structural 
relationships among a set of p measures which is encountered in biometric work (30, 
31) represents a special case of the congeneric test theory model, and thus parameter 
estimates could be obtained using ACOVS. 
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Schönemann (432) and Schönemann & Wang (433) have given renewed attention 
to the problem of indeterminacy of factor scores previously discussed by Guttman 
(200). The indeterminacy issue (not to be confused with identifiability problems 
associated with factor rotation) involves the fact that there exists more than one 
vector y of factor scores for each person which satisfies Model 1 for a given set of 
parameter values. Schöneman discussed relations between the characteristic values 
obtained in ML estimation and the indeterminacy problem, and he derived bounds 
for the minimum average correlation between equivalent sets of scores on correlated 
factors which depend on the characteristic roots of the factor correlation matrix 
$. He argued against the practice of “estimating” factor scores and suggested that 
factor analysis be employed solely as a procedure for investigating the structure of 
covariance matrices. Tucker (476), on the other hand, has examined “external 
characteristics” of four methods of estimating (or defining) factor scores, i.e. rela- 
tions of these methods to measures not included in the factor analysis. The four 
methods examined were: least squares, Bartlett’s (33) method, regression estimates, 
and Anderson & Rubin’s (13) proposed method for uncorrelated factors. Tucker 
concluded that none of the estimates has correlations with external measures equal 
to the “theoretic” values implied by the model, and he offered specific recommenda- 
tions for choosing factor score estimates for use in regression studies and in studies 
of group differences. 

McDonald (342) has been interested in developing a generalization of the UFA 
model to two or more groups of variables. The idea is, following Tucker (475), to 
determine factors common to two groups of variables as well as factors specific to 
a single group of variables. McDonald generalized the UFA model by relaxing the 
requirement that the matrix Y? in Model 1 be diagonal. Instead, if the variables in 
the score vector x are arranged in groups and the covariance matrix 2 is partitioned 
according to these groups, Y? is block diagonal, a block corresponding to each group 
of variables. McDonald also suggests that the covariance matrices contained in the 
diagonal of Y? also be factored, resulting in battery-specific factors as well as in 
interbattery factors. Three alternative models are considered, the most realistic of 
which is probably a “group canonical factor” model which generalizes the canonical 
factor model. An alternative treatment of groups of variables has been presented by 
Jöreskog (265, 267) for multitrait-multimethod data (82) and by Jöreskog (263) and 
Werts, Jöreskog & Linn (490) for multitest-multioccasion data using the restricted 
factor (RFA) model. Jöreskog presents a number of alternative models, each of 
which represents a special case of Model 2. 

In spite of the advances made in statistical estimation, approximate procedures 
for estimating the parameters of the UFA model continue to be used. These methods 
will probably remain important in exploratory work and in obtaining start values 
for input to iterative estimation programs in fitting restricted factor models. Among 
approximate procedures, that of calculating principal components of a sample corre- 
lation matrix, retaining characteristic vectors associated with characteristic roots 
greater than one, followed by normal varimax rotation still appears to be the most 
popular single method of factor analysis. Kaiser (273, 276) has concluded that this 
method needs revision and has presented a revised procedure which modifies this 
procedure in a number of ways. The new procedure results in correlated factors. 
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A number of investigators have conducted Monte Carlo studies to explore possi- 
ble capitalization on chance in unrestricted factor analysis and analytic factor 
rotation. Tobias & Carlson (473) applied Bartlett’s test of sphericity to a random 
correlation matrix and suggested that this test be applied before any further struc- 
tural analysis of a correlation matrix is undertaken. Cliff (91) reported a Monte 
Carlo study of the sampling characteristics of the characteristic vectors of a correla- 
tion matrix with its diagonal elements replaced with the squared multiple correla- 
tions of each variable with all the remaining variables. Humphreys & Ugen (242) 
studied three approximate methods with real data, making independent parallel 
analyses of random normal deviates, and compared the characteristic roots of the 
real and random correlation matrices to attempt to determine the number of statisti- 
cally reliable factors. Resulting characteristic roots were also compared to the 
likelihood ratio chi-squares obtained in fitting a UFA model with number of factors 
equal to the ordinal number of the characteristic root. In another study, Humphreys 
et al (243) were concerned with possible capitalization on chance in factor rotation 
—either to a target matrix or using an analytic criterion. Correlation matrices 
generated from distributions of random normal deviates were analyzed using the 
iterated principal factor method. The authors concluded that seemingly meaningful 
rotated factors can be obtained from random data using either type of rotation. It 
may be that the only satisfactory solution to this problem is to approach the problem 
of analysis of covariance structures as a problem of comparative model fitting using 
restricted factor analysis (RFA) classificatory models. 


Classificatory models Problems of classification are of two sorts. One problem 
involves classifying a set of p statistical variates into a set of distinct or mutually 
overlapping classes based on the intercovariances of the variates. Examples of 
statistical models corresponding to this problem are the linear structural Models 1 
and 2 with the restriction that some of the elements of A and B are constrained to 
be zero. If the elements of A (say) in Model 1 are all zeros and ones, the model 
corresponds to a classification of variables in which a variable is either a member 
of a class or not a member [variance and covariance component models (57, 60, 260, 
266, 267)]. If values of A not specified to be zero are free to vary (are estimated), 
then the model corresponds to a classification which allows for degrees of member- 
ship of a variate in a category (RFA models). A second problem involves classifying 
a sample of n + 1 objects (individuals) into object-classes based on a set of p 
measurements obtained on the objects. The statistical model associated with the 
problem of object classification has been described as a problem of unmixing a 
mixture of multivariate distributions by Wolfe (500), who has developed a computer 
program based on ML theory for unmixing mixtures of multivariate normal or 
multivariate Bernoulli distributions. In general, the problem of classifying variates 
involves the pairwise statistical dependence of the variates; the problem of object 
classification involves interobject distances and the statistical dependence of the 
variates (65). Fleiss & Zubin (150) have pointed out that these two distinct problems 
often have been confused and argue persuasively against the practice of correlating 
individuals as a basis for classifying them. In this section, we will consider the 
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problem of classifying variates. Certain heuristic clustering procedures which were 
developed for object classification but which are applicable to the problem of cluster- 
ing variates, will also be reviewed here." 

Two types of classificatory models have been considered: single order, in which 
the variables are classified at only one level, and hierarchical, in which the clusters 
of variates at one level are themselves clustered at the next level. A complete 
hierarchical clustering (214, 256) consists of a series of levels, ranging from one in 
which each variable is represented as a separate cluster to one in which all p objects 
are grouped together as a single cluster. Single level classificatory models can be 
represented as special cases of Model | by fixing appropriate elements of A equal 
to zero. Two-level (second order) models can be fit using Model 2. Hierarchical 
models requiring more than two levels require still more elaborate structural models 
and thus have been implemented only as heuristic clustering procedures. 

A number of special cases of Model 2 in which elements in A or B are fixed equal 
to zero are of general interest (cf. 260, 266, 267 for a summary of models which may 
be represented as special cases of Model 2). Consider first models for the analysis 
of sets of congeneric tests (265). A model for a single set of congeneric tests already 
has been described. This model may be generalized to several sets of congeneric tests 
by setting up a column of A for each true score containing zeros for each test not 
in the set measuring that true score. In the case in which all tests in each set are 
parallel, this model leads to a model of compound symmetry (480) which requires 
symmetry within each set of variables (equal variances and covariances) and with 
respect to variables in all sets. Under either model, the intercorrelations of the true 
scores can be estimated. A procedure for testing the hypothesis that the correlation 
coefficient corrected for attenuation between two tests (i.e. the correlation between 
their true scores) is equal to unity is obtained by specializing the model for sets of 
congeneric tests to the case of two sets each consisting of two tests. In this case, there 
is a single correlation & in ® (under Model 1) to be estimated. A likelihood ratio 
test is obtained by taking the ratio of the likelihood when 4), is restricted to unity, 
to the likelihood when du: is estimated. The test can also be made under the more 
restrictive assumptions of parallel tests. See (324), (152), and (299) for other recent 
work on this problem. If Model 2 is used, and I = A®A' + Y? contains the 
intercorrelations of the true scores obtained from fitting the model for sets of 
congeneric tests, then the factor structure of the true scores can be studied by 

°In addition to Wolfe’s (500) paper and the heuristic clustering procedures reviewed in this 
section, a number of other approaches have been made to the problem of object classification. 
For example, Friedman & Rubin (157) attempted to obtain that partition of a set of objects 
into / groups which maximizes the sum of squared Mahalanobis’ distances between the groups 
(i.e. the multivariate F statistic for that grouping). Borucki, Card & Lyle (65), attempting to 
provide a means of distinguishing between predictive clusters and clusters which may have 
arisen due to chance fluctuations in the data, presented a method which first clusters the 
observations and computes spatial boundaries for the clusters and then classifies the clusters 
into groups that represent statistical dependence between two or more variates, groups that 
represent the coincidence of peaks in the marginal distributions, and those that represent 
Statistica] fluctuation in the data. 
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specifying a structure for I’. Such a model represents a special case of a second order 
restricted factor analysis model in which the first level factors specified by the matrix 
B are nonoverlapping. Jöreskog (265) has also described a model for the use of sets 
of congeneric tests in multitrait-multimethod studies in which a number of different 
traits (true scores) are measured each with a number of different methods or measur- 
ing instruments (multiple indicators). 

Other applications of Model 2 which have been studied include restricted factor 
analysis (RFA) models, both first order (Model 1) and second order (Model 2); 
components of variance models (57, 60, 260, 266, 267); models for multitest-multi- 
occasion correlation matrices (263); models for studying change (490); and classifi- 
catory models with dimensions of quantitative variation (124). An RFA model is 
a factor analysis model in which the factors are specified a priori by the locations 
of the zeros in the factor matrix. Such a model is appropriate for confirmatory 
studies in which hypothetical structure is to be verified against a sample covariance 
matrix. In exploratory studies in which the investigator cannot specify a structural 
hypothesis, an heuristic procedure may be used to initially locate the zeros. In both 
confirmatory and exploratory studies, alternative models may be fit to a covariance 
matrix until one is found which best reproduces the sample covariances. In most 
instances, more than one hypothetical structure will be considered, and often there 
will be a natural order in which alternative hypotheses are considered. For example, 
often one proceeds from models containing fewer parameters to more elaborate 
models containing more parameters to be estimated. Joreskog (263) has suggested 
such a sequence of models for studying the structure of multitest-multioccasion 
correlation matrices such as might be obtained in longitudinal growth studies or 
studies of learning (including a model which contains stochastically ordered vari- 
ates). Werts, Jöreskog & Linn (490) presented a specialization of certain of these 
models to studies of growth in which multiple indicators of initial and final status 
on a growth variable are obtained, and multiple measures are also obtained of one 
(or more) traits which are regarded as determinants of growth. Substantive infer- 
ences about growth may be based on estimates of the parameters of their model, and 
various commonly used formulas for reliability of differences, correlation of true 
initial status with true gain, and correlations of true scores over time can be derived 
by imposing appropriate restrictions on their model. 

Degerman (124) has described an interesting problem in which quantitative di- 
mensions are superimposed on a discrete class structure and observed measures 
reflect mixtures of class and quantitative variation. Degerman described heuristic 
procedures for first clustering the measures into discrete classes and then determin- 
ing within-cluster dimensions of quantitative variation. Gebhardt (172) has treated 
a similar problem in which it is supposed that some of the factors underlying a 
covariance matrix are known, while others are to be investigated using the UFA 
model. Both of these problems involve mixed reduced-rank and classificatory mod- 
els. 

Heuristic procedures generally do not specify a classificatory structure a priori but 
rather seek to classify a set of variates through a series of operations on a similarity 
matrix containing pairwise similarity measures. If variates are to be classified, the 
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similarity matrix is usually a correlation matrix; if objects are to be classified, the 
matrix contains interobject distances. Heuristic clustering schemes may be single 
level (the number of clusters unspecified or specified) or hierarchical. Hierarchical 
procedures may be agglomerative or divisive (148). A clustering may be invariant 
over monotone transformations of the similarities, or may treat similarities as mea- 
sured on an interval or ratio scale. Fisher & van Ness (148) have defined nine 
properties which they feel should be satisfied by any reasonable clustering proce- 
dure, and they have investigated the admissibility of a number of kinds of clustering 
procedures with regard to these properties. Any clustering procedure involves a 
similarity function defined for single objects or variables and for clusters and a 
criterion for merging (or dividing) clusters. Johnson (256) has described two proce- 
dures for hierarchical clustering which differ with respect to how the similarity 
function is defined for clusters. Thus, if x and y are two objects or variables which 
are Clustered at level J but not at level j - 1, and z is any third object not in a cluster 
with x and y, the distance d/x, yj, z of the cluster /x,y/ to z may be taken either 
as the minimum of d(x,y) or d(y,z), or the maximum. The rule for merging clusters 
at level j is: search the matrix of similarities at level j - 1, find the minimal nonzero 
similarity, and merge the pair of objects (variables, clusters) with this similarity. A 
frequently used third way of defining the similarity function (cf. 286, 326, 474) is 
to define similarity for clusters in terms of cluster centroids. Hierarchical clustering 
procedures have been described in a number of papers (214, 215, 238, 239, 256, 286, 
350, 442). A single level clustering can always be obtained from a hierarchical 
procedure by stopping when the required number of clusters has been obtained. 
Several authors (114, 326, 474) have described single level clustering algorithms and 
a number of papers compare several clustering algorithms (239, 386, 350). While 
most are maximum, minimum, or centroid procedures, some procedures seek to find 
a clustering which minimizes an objective function of the data. For example, King 
(286) described a procedure which minimizes Wilks’ statistic to test the hypothesis 
of independence between two groups. McQuitty’s methods include his “iterative 
intercolumnar correlational analysis” (350). Shafto (442) has explicated this method 
by showing that it achieves a solution by transforming the test vectors toward a 
bipolar, one-factor structure. He offers two alternative methods for extracting a 
single bipolar factor (“linear contrast”) directly from the similarity matrix. 


Ordered variate models Suppose we have a set of p variables and that p - 1 of the 
variables are each derivable from others in the set by linear regression. In particular, 
consider the first order case in which each of the p - 1 variables is derivable by linear 
regression from only one other variable in the set. If the p variables form an ordered 
sequence such that variable j is derivable from variable j - 1 in the sequence (f = 
2,..., DÄ then the p variables are completely ordered by the linear structural model. 
A set of p variables may also be partially ordered by a system of linear structural 
equations which specifies a network of linear dependency relations for the variables. 
In most instances, the order or partial order is given a priori and the statistical 
problem is to determine whether the sample covariance matrix reflects the hypothet- 
ical order or partial order. 
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The most common situation involving a complete order is a continuous state 
discrete parameter stochastic process (in time) or time series, with p measurements 
made sequentially and with a constant interval of time between observations. The 
simplest situation involving observations ordered in time is a pretest-posttest design 
involving only two observations (cf. 490). In an important paper, Cronbach & Furby 
(107) have reviewed previously proposed procedures for measuring change in pre- 
test-posttest situations, offered an alternative procedure for estimating true residual 
gain scores, and then have proceeded to recommend procedures for various research 
problems involving change which do not require estimation of change scores for 
individuals. Gottman, McFall & Barnett (190) have argued that a better research 
Strategy than the pretest-posttest design is a time series design in which successive 
observations are obtained throughout a programmed intervention and in which the 
characteristics of a change process can be assessed. Time series research designs 
permit one to study both the characteristics of a stochastic process in time and to 
detect changes in time-dependent stochastic process. Such changes may be experi- 
mentally induced or they may be spontaneous. Two aspects of a stochastic process 
are of interest in both cases: trends (which may be present in the means; in the 
variances; and in the means, variances, and covariances) and stochastic dependency 
(i.e. autoregression) which may be present in the structure of the p x p matrix of 
serial correlations. With respect to detection of change, Gottman, McFall & Barnett 
(190) considered change in trend using both curve fitting and moving-average 
approaches, and Jones, Crowell & Kapuniai (258) have described a model for 
detecting change from stationarity in multivariate stationary stochastic processes. 
Murray, Wiley & Wolfe (367) and Jöreskog (263) have presented discussions of the 
use of linear structural models in longitudinal developmental research. 

A number of papers have appeared on the subject of first order autoregressive or 
simplex models (262, 364, 365, 431). Jöreskog (262) defined six types of (mean- 
nonstationary) first order autoregressive models, including a covariance-stationary 
model (Wiener simplex) and a covariance-nonstationary model (Markov simplex). 
The first order autoregressive series is 
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where the residuals č; are uncorrelated and ß; is a regression coefficient. In the 
Wiener simplex, 8; is a constant (which may be set equal to one) while in the 
Markov simplex 8; is time dependent. Since error of measurement may be present 
in the p variables, the autoregression holding for the true scores, Jöreskog (196) also 
identified models incorporating an additional error component which he refers to 
as quasi-Wiener and quasi-Markov simplex models. Jöreskog has represented each 
of these models as a special case of Model 2. Thus, ML estimates of all free and 
constrained parameters can be obtained using ACOVS and likelihood ratio chi- 
squares obtained. It is well known that simplex models are likely to be appropriate 
for longitudinal growth or learning data. Mukherjee (364) has derived the ML 
solution for the quasi-Wiener simplex and Schöneman (431) has developed a least- 
squares procedure for both the quasi-Wiener and quasi-Markov simplex models. 
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Groen (196) has discussed the connection between discrete state Markov processes 
considered in mathematical learning theory and the Markov simplex. 

Several investigators have worked with more general autoregressive models. 
Subba Rao (460) has presented weighted least-squares estimation procedures for 
nonstationary time series models of order K. Note that autoregressive models of 
order k > 2 require a more general structural mode] than Model 2. Gottman, 
McFall & Barnett (190) have argued that a first-order autoregression is likely to be 
sufficient for most applications in psychology. Jones, Crowell & Kapuniai (258) 
considered multivariate time series models and presented a model for detecting 
change in a multivariate time series in which repeated observations are made on 
r variables (an r-dimensional multivariate stochastic process). Their procedure 
involves fitting a covariance-stationary k-order autoregressive model to the serial 
covariance matrix based on observations made before an intervention, obtaining the 
residuals from predicting observations after the intervention, and then testing the 
residual variances for an increase produced by the intervention. Such a technique 
would appear to be valuable in many psychological applications. Likelihood ratio 
tests of the effect of an intervention can be made for first-order univariate stationary 
processes using Jöreskog’s program if a quasi- Wiener simplex is appropriate and can 
be fit to the serial correlation matrix obtained prior to an intervention, by testing 
the fit of the stationary (quasi-Wiener) model to the serial correlation matrix which 
includes correlations based on observations obtained both before and after the 
intervention. 

Other papers concerned with problems involving time series which may be of 
interest include papers on inverse autocorrelations and their application in fitting 
moving average models (90), linear regression with time-dependent residuals (12, 
171, 467), multiple regression with regression weights which are the realization of 
a stationary stochastic process (78), serial correlation (212, 358, 396), tests for trend 
in dispersion in times series (404), and tests for nonstationarity in time series (397). 

Path models involving networks of linear structural relationships have been em- 
ployed in economics and sociology to attempt to assess causal relationships among 
sets of linearly related random variables (55) and in biology for instrument calibra- 
tion (30, 31) and for certain problems in genetics (318). Goldberger (183) has written 
a very interesting paper on similarities of linear structural models used in economics 
and psychology, and Werts & Linn (492) have discussed psychological applications 
of path models. Many path models can be represented as specifications of the general 
Model 2, and examples of such models have been presented by Joreskog (260, 266, 
267). However, since many problems exist for which a more general model is 
required, Jöreskog (261) has presented a still more general model which is similar 
to that used in economics (55, 183, 280). Jöreskog’s model is different from that used 
in economics in that it represents a system of linear structural relations on the true 
scores and thus allows for errors of measurement as well as errors in equations. 
Jöreskog & van Thillo (271) have written an ML estimation program for this general 
model. Their computer program is still more general in that it allows for multiple 
indicators for the true scores. It should be noted that a simplex model is a special 
case of a path model in which the variables form an ordered sequence. Jöreskog 
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(267) has presented numerous examples to illustrate how path diagrams may be used 
to represent any linear structural model in which variables are classified or ordered. 
Since the determination of identifiability of parameters may often be an algebraicly 
complex problem, Werts, Jöreskog & Linn (489) have demonstrated how a visual 
inspection of the path diagram for a model can be used to simplify the identification 
problem. 

The model of Jöreskog & van Thillo is an example of a structural model which 
is both classificatory and ordered. Mixed classificatory-ordered variate structural 
models have also been suggested for multitest-multioccasion correlation matrices 
(263), for dynamic structural analyses which emphasize changes in factor structure 
(423), for analysis of lagged multiple time series (367), and for the study of alterna- 
tive sources of observed stochastic growth in several correlated measurements (154). 
The problem investigated in this latter paper involves seeking to determine by 
comparative model fitting whether observed growth in one measure is due to growth 
in other variables with which the measure is correlated. 


CANONICAL CORRELATION There have been a number of suggestions for general- 
izing canonical correlation analysis to three or more sets of variables (84, 234, 235, 
340, 345). The problem is to identify canonical variates which isolate and summarize 
linear relationships among several sets of variables and to measure the degree of 
relationship. It is also reasonable to require of any procedure that it reduce to 
Hotelling’s classical procedure when the number of sets of variables 1s two. Ketten- 
ring (285) has presented a comprehensive treatment of the canonical analysis of 
several sets of variables, and in addition to formulating earlier procedures in terms 
of random variables, he has provided a treatment which is general and within which 
he has introduced two new procedures. Suppose we are given m sets of variables 
1X,.+-) mX With p, variables in each set. At each of several stages s, we want to derive 
from the variables a set of m standardized canonical variates 


KY = GZ, ..., 20) 


which are linear composites of ,x, . . ., mX, respectively, and such that del maximizes 
or minimizes a particular function of its correlation matrix zl At each stage after 
the first, new sets of canonical variates must satisfy additional restrictions. For 
example, successive variates may be required to be uncorrelated within sets. At the 
first stage, the five criteria identified by Kettenring are: 1. SUMCOR: maximize the 
sum of elements of 6,(1'61), 2, MAXVAR: maximize the largest eigenvalue of 
GU), AW): 3. SSQCOR: maximize the sum of squares of the elements of $ (the sum 
of squares of its eigenvalues AP? +... + Am?) 4. MINVAR: minimize the 
smallest eigenvalue of 6"), A ®; and 5. GENVAR: minimize the generalized vari- 


ance of 2, 
em = TAM, 
i=’ 
Criteria 1 and 2 are due to Horst (234), 3 and 4 are introduced by Kettenring. 
Criteria 1, 3, and 5 require iterative computations. All procedures generate canoni- 


cal variates which make 6 as far as possible from an identity matrix. Each of these 
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procedures reduces to the classical procedure for m = 2. In his paper, Kettenring 
first presents a development for each of these criteria. He then treats the problem 
of choosing restrictions for higher stage canonical variates and presents computa- 
tional procedures for each of the five criteria. The paper includes a comparison of 
results obtained in applying the five procedures to real data. 

Following Rozeboom (422) and Stewart & Love (459), Miller & Farr (354) have 
suggested a measure of interbattery association for two sets of variables which can 
be obtained in the absence of a canonical correlation analysis. The measure of 
association between sets of orthogonalized measures is the proportion of total vari- 
ance in a battery explained through each of its components (any orthogonal linear 
composites such as principal components or canonical variates) by the other battery. 
The association measure developed is directional and asymmetric and is indepen- 
dent of methods employed to orthogonalize the sets of variables. 


LINEAR COMPOSITES A great many problems in multivariate analysis involve seek- 
ing a weighted linear combination of a set of variates such that the linear composite 
has some optimal property or properties. McDonald (340) has presented a unified 
treatment of weighting problems which define best weights according to the follow- 
ing common property: choose a normalized vector of weights associated with one 
or more extreme values of a quadratic or bilinear form whose matrix consists of 
covariances or correlations. McDonald has shown in his paper that eight different 
weighting procedures can be treated as special cases of a single, reasonable general 
weighting procedure corresponding to this property. The eight weighting problems 
considered are multiple regression, canonical correlation analysis, principal compo- 
nents of a correlation matrix, principal components of scale analysis, maximizing 
the correlation between two multicategory items, maximizing composite reliability, 
and canonical factor analysis. 

Another linear weighting problem involves finding least-squares linear composites 
of predictors for estimating several criteria which satisfy the restriction that the 
composites have an arbitrary specified correlation matrix. The optimal weights 
without any restriction on the correlations of the composites are the multiple 
regression weights for predicting each criterion variable. Green (191) has presented 
a solution to this problem which is more general than previous treatments of the 
problem and whose derivation depends on a general result concerning best-fitting 
orthonormal transformations. The procedure is applicable to the problem of obtain- 
ing regression estimates of factor scores whose correlations are the same as the factor 
correlation estimates obtained in fitting the factor model. 


ESTIMATION: PARAMETERS OF CONTINOUS MULTIVARIATE DISTRIBUTIONS We 
consider first bivariate distributions. Several papers have appeared concerning 
effects of selection and restriction of range on the product moment correlation 
coefficient for a bivariate normal distribution. Pearson’s formula for correcting for 
univariate selection is based on the assumption of normality of variates before and 
after selection. Brewer & Hills (74) examined empirically the influence of skew upon 
estimated correlation using Pearson’s adjustment for univariate selection. They 
generated symmetric bivariate distributions and then introduced different levels of 
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correlation and skewness in the predictor marginal distribution. The corrected 
correlation was computed for each cumulative tenth of cases from each direction 
of the predictor variable and in this manner effects of size of correlation, degree of 
skew, and degree of restriction were studied. Brewer & Hills concluded that “even 
with essentially symmetric distributions, a large proportion of the data is necessary 
to obtain reasonably precise estimates of low correlations. With increasing skew, 
estimates become increasingly erroneous, the direction of the error depending upon 
which tail of the distribution is the basis of the estimates.” Effects of truncating a 
bivariate normal distribution on sample variances have been studied by Rao (405). 
Bryant & Gokhale (76) presented a formula for correcting a correlation for restric- 
tion in range for the special case in which the variances of the two variables in the 
unrestricted sample are known. Their formula is appropriate in situations in which 
the selection is on some unknown variable and assumes that the distribution after 
selection is bivariate normal. Several other papers on correction for restriction of 
range have appeared (151, 314, 488). 

Fhaner (144) applied an information measure in an empirical study of informa- 
tion loss when Pearson r is applied to non-normal data. Elston & Stewart (140) 
proposed a new test statistic. for detecting association between two continuous 
random variables. Connor & Mosimann (98) have considered concepts of indepen- 
dence for non-negative continuous random variables which sum to one. For data 
which consist of proportion scores for individual subjects, their treatment may be 
appropriate. A concept of neutrality is developed and related to a concept of inde- 
pendence for proportions, 

A frequently encountered problem in multivariate analysis is that of obtaining 
sample estimates of the parameters of a multivariate normal population when the 
sample data are incomplete. One approach to the missing data problem is to estimate 
values for the missing data points and then compute sample statistics based on both 
observed and estimated values. Timm (471) has published an empirical comparison 
using simulated data of four such methods of estimating correlation and covariance 
matrices when observations are missing at random from the data matrices. The 
effects of sample size, number of variables, percent of missing data, and average 
intercorrelation of variables were examined for the four methods. Methods investi- 
gated were: (2) a method involving decomposing a score matrix into its known and 
unknown parts and using principal components of the known part to estimate the 
unknown part; (6) substituting the sample mean of each unknown variable (based 
on known data) for unknown values; (c) multiple regression estimates; and (d) using 
only the complete data. While no uniformly best technique was found, methods 
a and c were found to be superior independently of sample size and number of 
variates when 1%, 10% and 20% of the data were missing. Games (165) has 
described an efficient computer algorithm for method d 

A second approach to the missing data problem involves attempting to obtain ML 
esimates of parameters under normality assumptions, based on the available obser- 
vations. Individual missing data points are not estimated. One problem with these 
procedures is that they can yield estimates of Z which are not positive definite (i.e. 
for which all characteristic roots are not greater than zero). Hartley & Hocking 


STATISTICS AND DATA ANALYSIS 481 


(216) have attempted to provide a taxonomy for incomplete data problems and to 
develop a unified method of analysis based on ML estimation. An earlier paper 
(Hocking & Smith 227) presented a method for estimating the parameters in a 
multinormal distribution in which missing observations are not restricted to follow 
specific systematic patterns. Morrison (361) has presented a procedure for a single 
incomplete and q complete variates. 


HYPOTHESIS TESTING We will consider first hypotheses on correlation coefficients. 
When two correlation coefficients are calculated from a single sample, they are not 
statistically independent and the usual methods for testing equality do not apply. 
Dunn & Clark (135) have investigated power curves for several large sample tests 
based on Fisher’s z-transformation and have also investigated possible alternative 
simple methods suitable for small samples using Monte Carlo methods to determine 
level of significance and power. Choi (86) has presented a sequential test for correla- 
tion coefficients. The sequential test permits a substantial reduction in average 
sample size compared to that required in a fixed sample test. Jennrich (248) has 
described an asymptotic chi-square test for the equality of two correlation matrices. 

In testing for a treatment effect, an experimenter may have to use the same 
subjects for control and experimental groups. Independence no longer holds, and 
tests of bivariate symmetry may be appropriate. Bell & Doksum (43) have shown 
that bivariate symmetry is not equivalent to univariate symmetry, and they dis- 
cussed alternative tests for this hypothesis. Repeated measures analysis of variance 
models require that the covariance matrix of the repeated observations have com- 
pound symmetry (480). In the case in which constant correlation can be assessed, 
it may be desired to test the hypothesis of homogeneity of variance for the correlated 
variables. Han (207) has proposed four test procedures for this hypothesis and 
compared the four tests. A number of papers have dealt with problems involved in 
testing the assumption of multivariate normality and proposed tests of this hypothe- 
sis (8, 323, 335, 481). 


DISTRIBUTION THEORY This section will identify papers of two sorts: those that 
consider the distribution of statistics which arise in testing correlations or various 
multivariate hypotheses, and papers on generating multivariate data. Kristof (297) 
has discussed properties of a statistic arising in testing correlation and applied the 
statistic to the psychometric problem of testing point hypotheses on coefficient a 
based on dividing a test into two parts. Papers on the distribution of various 
multivariate test statistics include: multiple correlation (198); likelihood ratio crite- 
ria for tests of independence of sets of variates (101), compound symmetry (103), 
bipolarity of a covariance matrix (102), and sphericity in a p-variate normal distri- 
bution (100, 254); largest characteristic roots of covariance and other symmetric 
matrices (221, 462); traces of Wishart matrices (249); and the ratio of two general- 
ized variances (24). Capra & Elster (83) discussed a procedure for generating N 
multinormal observations on p-variates where the p-variates have specified means, 
variances, and covariances; and Smith & Hocking (451) have described a subroutine 
for generating a covariance matrix from a Wishart distribution. 
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Ordinal Data 


STRUCTURAL MODELS Structural models for ordinal data have been developed 
both in the context of factor analysis (cf. 201, 202, 319, 320) and multidimensional 
scaling [cf. the reviews of Zinnes (506) and Cliff (92), the recent book edited by 
Shepard, Romney & Nerlove (444), and the bibliography prepared by Harris (213)]. 
Ordinal (“‘nonmetric’’) structural models for matrices of ordinal similarities may be 
rank reducing (Spatial) or classificatory. Rank reducing models seek to generate a 
configuration of points in a Euclidean space of smallest (or specified) dimensionality 
which is capable of reproducing the rank-order of the similarities of the objects so 
represented. Classificatory models seek to generate an object classification from 
which ordinal similarities can be reproduced. As yet only heuristic estimation 
procedures have been produced for structural anaiysis of ordinal data (Kruskal & 
Carroll 300). Work on classificatory models for ordinal similarities has already been 
reviewed. Research on rank reducing models is regularly reviewed in the Annual 
Review chapter on scaling and will not be covered here (92, 139, 506). 


CONTINGENCY TABLES WITH ORDERED CATEGORIES Frequently it is necessary to 
measure association between variables that are rank orderings in which many ties 
occur among the rankings. Such data are often presented in a cross-tabulation. It 
may be of interest to extend commonly used measures of association to multiway 
cross classifications and to develop measures analogous to partial and multiple 
correlation coefficients for such tables. Davis (122) has defined a partial coefficient 
Yanc in a manner analogous to Goodman & Kruskal’s interpretation of their coeff- 
cient Y. They define Y,45c as “how much more probable it is to get like than unlike 
orders in measures A and B when pairs of individuals differing on A and on B and 
tied on C but unselected on any other measure are chosen at random from the 
population” (p. 190). They show that this coefficient is a weighted sum of the values 
of Y in the various strata defined by the categories of C. Somers (453), seeking to 
provide a measure of partial association in three-way tables which takes account of 
the ordered nature of the control variable, has presented a coefficient based on 
Kendall’s partial coefficient but modified to distinguish the independent from the 
dependent variable. 

Two very general models for the analysis of contingency tables with ordered 
categories have been presented by Williams & Grizzle (498). Their methods apply 
a recently developed linear models approach to the analysis of interaction in contin- 
gency tables, and the analysis can be performed using the weighted regression 
methods of Grizzle, Starmer & Koch (194). The models permit investigation of 
experimental design variables with category effects and utilize information con- 
tained in the ordered responses. These methods will be described in the next section 
on discrete (categorical) data. Finally, Weisberg (487) has developed an approach 
to the problem of comparing two multinomial distributions with E categories in 
which the categories are ordered. 


Categorical Data 


ADDITIVE COMPONENT STRUCTURAL MODELS A major development during the 
period reviewed is a general model for the analysis of categorical data arranged in 
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multidimensional contingency tables. The approach taken in this work is to trans- 
form observed proportions contained in the cells of a multidimensional contingency 
table by means of a specified response function which may be linear, log-linear, or 
logistic, and to represent the transformed data by means of an additive component 
structural model analogous to that used in analysis of variance. Thus a design matrix 
can be specified to estimate any effects which are of interest and the effects then may 
be polled for hypothesis testing. Hypotheses that can be considered may involve the 
analysis of main effects or interaction effects in tables based on multiple independent 
samples, they may involve tests of conditional independence in multivariate tables 
based on a single sample, or they may involved both independence and interaction 
effects. Since the statistical treatment of these hypotheses is identical, work on the 
application of the linear models approach to the study of group differences in 
contingency tables will be included in this section. 

The analysis of categorical data by linear models has been approached using 
minimum chi-square (weighted least squares), maximum likelihood, minimum dis- 
crimination information, and minimum Pearson chi-square estimation. We will 
outline first the general model for the analysis of a multidimensional contingency 
table as presented by Grizzle, Starmer & Koch (194) for an s x r table using 
weighted least squares, and then discuss other approaches to estimation. Suppose 
we have an s x r contingency table with observed proportions Do which estimate 
the probabilities 7. Let the rows represent different populations and the columns 
represent different categories of response. Let 


T'i = (Tin 3 Tir), 
the sth row of the expected contingency table, and 

p'; TE (Pin KSE? Pir) 

the corresponding sample estimate. Let 

mw’ == (m'n... 0, 
contain the entire table “strung out” into an rs element vector. Then var (p,) = V 
(ëch e a function of ge (for all cells), and the corresponding sample estimate 
V(p,) can be computed. Let V(p) be a block diagonal matrix having V(p,) on the 
main diagonal. Then arbitrary response functions of the elements of 7, fn (m) may 
be defined (m = J, ..., u) such that each function has partial derivatives up to 


second order with respect to the elements of m, and the functions are jointly 
independent. Let f (p) be the mth function evaluated at 


mT = p, and let F' = (f,(p), DEE f,,(p)) 


which estimates F(7r)'. The covariance matrix of F is S = HV(p)H' of rank u where 


uxrs\ dll, 


(284, p. 232). When £,,(p) is a linear function of the elements of p, S is the exact 
covariance matrix of F; otherwise it is the asymptotic covariance matrix. 
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Now assume the linear structural model F(a) = Xß where X(u x v) is a known 
design matrix of rank y < u and £ is a vector of v unknown parameters. A weighted 
least-squares estimate b of £ is obtained by finding that vector b which minimizes 


(P - Xb)’ SF - Xb). 


From here on the treatment is conventional weighted least-squares regression, and 
hypotheses may be tested in the same manner as in the linear models approach to 
analysis of variance if the model fits. All test statistics have asymptotic chi-square 
distributions with appropriate degrees of freedom. Particular models may be speci- 
fied by choosing the response functions F and specifying a design matrix X. Chi- 
squares are additive and may be pooled to test specific hypotheses. 

The maximum likelihood (ML) approach begins with an assumed sampling distri- 
bution for the sample observations. Three sampling schemes lead to identical unique 
ML estimates for complete or incomplete tables [if the zero cells are sampling zeros 
rather than structural, Le a priori zeros, (146)]; independent Poisson variates for 
each cell, a single multinomial sampling scheme, and a set of independent mul- 
tinomial sampling schemes. All three sampling models lead to the log likelihood- 
ratio statistic L = 2% ny In (n,/m,) = 2nd py In (p/m) where nj is a cell 
frequency, my is an ML estimate of po n = Ln j and the summations are over 
all the cells in the table. L has an asymptotic chi-square distribution with degrees 
of freedom obtained in the usual way. If it is desired to transform the proportions 
and assume a linear structural model for the transformed proportions, the likelihood 
function can be written as a function of the parameters of the linear model. The most 
frequent assumption is the log-linear model: Inm;= u + Up + Yay + Gsm for 
each cell in the table. ML estimates of the parameters of this model are thus 
obtained, and the ML chi-squares for different hypotheses are additive. Fienberg 
(145) has described an iterative procedure for obtaining ML estimates for this 
model. The minimum discrimination information approach of Ku & Kullback (301) 
leads to estimates that are identical to the ML estimates. Statistical properties of the 
different estimates have been thoroughly discussed by Grizzle, Starmer & Koch 
(194), Fienberg (146), Bishop (53), and Goodman (188). 

It remains to identify particular response functions Az) which have been used 
and to indicate specific hypotheses which can be tested using each function. Four 
classes of response function can be defined which cover all the applications we have 
found in our review. The classes of response functions are as follows: 1. linear 
functions: F(7r) = Az, where A is a u x rs matrix of arbitrary specified constants 
(cf. 194, 195, 293). 2. Logarithmic-linear functions: F(a) = K In Ar, where A is 
as given under 1, K is a fx u matrix of arbitrary constants, and F(r) is a £ x /column 
vector (53, 54, 146, 186-189, 194, 195, 293). Note that the logistic response function 
logit py = In(p,/( - py)) is a special case of 2 (47, 48, 53, 194, 498). 3. Exponential- 
logarithmic linear functions: F(a) = exp[K In Ar] (293). 4. Inverse normal ogive 
(normit) response function: F(r) = ®!(r), where Gitar Ais the unit normal deviate 
corresponding to the cumulative normal probability oo This latter function is used 
principally in bioassay and in item analysis. It is well known that the logit and 
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normit functions yield values that are practically equivalent except for extremely 
small or large probabilities. 

The references cited after the definitions of each response function above contain 
numerous examples of analyses applying linear models to transformed proportions. 
Several classes of hypotheses may be identified. First consider categorical data 
collected on a single population tabulated in a multiway contingency table to study 
the associations among the attributes associated with the ways of classification. 
Hypotheses that can be considered in this case using response function 1 are 
homogeneity of marginal distributions (194) and symmetry and marginal (condi- 
tional) homogeneity (244). Hypotheses using response function 2 are independence 
(195), quasi-independence both in tables containing structural zero frequencies (146, 
186) and in tables with incomplete response patterns (195, 293), and partial or 
conditional independence (195). When categorical data are collected on several 
populations (groups of individuals, treatment groups), the linear model analysis is 
very similar to analysis of variance for continuous data. Hypotheses involving 
several populations can be studied using any of the above response functions. 
Hypotheses include main effects, interactions (53, 194, 293), and hypotheses involv- 
ing ordered response classes (498). The advances represented by this work make it 
possible now to perform the same kind of detailed analyses on qualitative data that 
have long been possible for quantitative data. 


LINEAR STRUCTURAL MODELS Consider a set of binary classifications on p dis- 
crete random variables (or items). It may be of interest not only to investigate 
hypotheses regarding the independence of the variates arranged in a p-way contin- 
gency table but also to study the dependence structure of the variates in a manner 
analagous to that employed in the study of structural relations of p continuous 
random variables. The approach taken to this problem is similar to that developed 
for the analysis of multidimensional contingency tables using additive component 
models. The items are assumed to represent dichotomous (or polychotomous) ran- 
dom variables that are derived from underlying continuous random variables by a 
process such as that for item 4 p;= Prob (y,>.A,) for some cutting point 4;. A linear 
structural model can then be assumed for the p element vector y. If y is assumed 
to be p-variate multinormal, the cases of general Model 2 can be considered. 
Response functions that have been considered for the proportions p; are the linear 
(312, 341), the logistic (47, 52), and the normal ogive function (21, 58, 59, 61, 87). 
The most frequently assumed structural model that has been considered is the 
one-factor (rank one) model. Bock & Wood (61) have recently reviewed this work 
for the 1971 Annual Review of Psychology in their chapter on test theory. An 
interesting application of the one-factor model in bioassay is provided by Ashford 
& Sowden’s (21) paper on multivariate probit analysis. If the general Model 2 (or 
still more general models) is to be applied to the problem of analyzing dichotomous 
variables, one can adopt the normal ogive response function and assume y is 
p-variate multinormal with covariance matrix satisfying Model 2. Christofferson 
(87) has recently attempted to develop an estimation procedure for the RFA model. 
Since ML estimation of parameters will lead to large amounts of computation and 
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therefore severely limit the number of items that can be analyzed, Christofferson 
suggested estimation procedures based on the marginal distributions for single and 
pairs of items. Two procedures are developed: a generalized least-squares procedure 
and a simple two step least-squares procedure. The former estimators are asymptoti- 
cally efficient among estimators that use the same amount of information and the 
function minimized is asymptotically chi-square. Unfortunately, the method re- 
quires an extremely large amount of computer storage. The estimation problem 
associated with fitting linear structural models to dichotomous (or polychotomous) 
variables is in much need of further development. 


TETRACHORIC CORRELATION Bock & Lieberman (60) found that they were able 
to get heuristic estimates of the parameters of the one-factor normal ogive model 
by fitting a one-factor model to the matrix of inter-item tetrachoric correlations 
using Joreskog’s program, provided the tetrachoric correlation coefficients were 
computed to a high degree of accuracy. They obtained accurate estimates (to nearly 
four figures) by interpolating in a table of the bivariate normal distribution. In the 
interest of finding a fast and accurate method, Froemel (158) compared several 
existing approaches using tabled probabilities for the bivariate normal distribution 
to generate data. A computer routine by Saunders was identified as most accurate 
for the widest range of data. Other papers on tetrachoric correlation are (113, 185, 
204, 205). 


DISCRETE STRUCTURAL MODELS Murray (366) has recently presented a general 
model for fitting discrete structural models to an m-way contingency table in which 
each classification corresponds to response categories for one of m items. Let 
Pa» a,’ VO aan 
be the observed proportion for the a,, 42, ..., Sa cell (a; = 1,..., N; response 
categories for the sth item), and let 
Tap "rr OD 
denote a latent (theoretical) probability parameter for the a, ..., Gah cell. The 
latent probabilities are assumed to be specified functions of some smaller set of basic 
latent parameters H. Let 


Ore. 
t t 

denote a misclassification probability for the sth item; i.e. the probability that a 
member of the oa latent state (category) is observed in category a,. Let p be a vector 
containing the observed proportions, 7r a vector containing the corresponding latent 
probabilities, 

QV = Gia) 
be an (N; x N;) matrix of misclassification probabilities for the sth item, and Q = 


Q'8Q’8 ... @Q™ be an N X N matrix for all latent and observed cells (where @ 
denotes a Kronecker product). Murray has shown that under the assumption that 
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misclassifications are independent from item to item for subjects in the same latent 
state, p = Qr. The procedure is to constrain the latent probabilities m by means 
of an assumed structural model and obtain ML estimates of parameters by maximiz- 
ing the likelihood function subject to the constraints imposed by the model. Models 
considered by Murray are pure Guttman scales, discrete state Markov processes, 
and causal models. Applications of the models to developmental data are presented. 


REGRESSION WITH CATEGORICAL DEPENDENT VARIABLES Neter & Maynes (369) 
have discussed the appropriateness of the correlation coefficient with dichotomous 
dependent variables in an interesting paper which focuses on problems relating to 
curvilinearity and the nature of the prediction being made. They concluded that the 
correlation ratio is likely to be a better measure than the coefficient of determination 
because it is free of restrictions on the functional form of the relationship. Alterna- 
tive measures are also considered. Morrison (362) has considered the problem of 
evaluating prediction models that generate probabilities for binary outcomes. Morri- 
son suggests that a correlation between the binary outcomes and the prediction 
probabilities can yield meaningful results. Finally, Stanley (457) has discussed a 
similarity between biserial r and Glass’s (178) biserial r for ranks. 


MULTIPLE DEPENDENT VARIABLES, MULTIPLE 
POPULATIONS 


Continuous Interval Data 


The developments to be described in this section almost all assume multivariate 
normality and consist of extensions of multivariate analysis of variance techniques 
and discriminant analysis. This seems to be an appropriate place to mention that 
a number of new and useful books on multivariate analysis have appeared in recent 
years. We single out Cooley & Lohnes (104), Dempster (126), Morrison (360), and 
Tatsuoka (465) as especially worthy. 

Several review articles have appeared recently that the student or novice user of 
multivariate technique should find illuminating. The articles by Pruzek (398), Bay 
(35), Finn (147), and Porebski (388) illustrate various multivariate methodologies 
for studying group differences within the context of multivariate analysis of variance 
and discriminant analysis. A significant review of recent work at a more technical 
level is provided by Rao. (408) Joe & Anderson (251) have developed a series of 
potentially useful measures of association between treatments and effects in mul- 
tivariate ANOVA correspondence to the various test statistics (e.g. likelihood-ratio, 
Hotelling’s trace) commonly used in these setups. Another review paper worth 
careful perusal is Kempthorne’s (282), in which he raises questions about the utility 
and purposes of multivariate analysis generally. See also the papers by Healy (222) 
and Hummel (241). 


SIMULTANEOUS TEST PROCEDURES Perhaps the most exciting and potentially use- 
ful recent work in multivariate ANOVA is the work on simultaneous test procedure. 
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Roughly speaking, simultaneous test procedures provide post hoc comparisons 
given that a significant overall result has been observed. In the multivariate case, 
this may involve studying particular variables and subsets of variables or linear 
composites of variables as well as comparing pairs of treatment groups and other 
contrasts on treatment groups. Thus these procedures permit some resolution of 
overall significant results down to single variables and contrasts. The test procedures 
are based on the eigenvalues of the product of the hypothesis and within-groups 
dispersion matrices. Gabriel (160-162) discusses test procedures based on all the 
usual multivariate significance tests (e.g. largest root, etc). His papers are quite 
readable, though technical, and merit extensive scrutiny by psychologists. Gabriel 
recommends the largest root procedures in general but points out that if the experi- 
menter is merely interested in comparing all pairs of groups on all single variables, 
a battery of ordinary t-tests cannot be improved upon. Note, however, that signifi- 
cance levels are not calculated in the ordinary way as for one pair of groups and 
one variable. Other useful papers on simultaneous inference are those of Krishnaiah 
(296) and Jensen (250). 


DATA ANALYSIS Another set of exciting developments in multivariate methodol- 
ogy involve what are becoming known as data-analytic methods. Cooley & Lohnes 
have adopted an explicitly data-analytic outlook in their revision (104) of their 
well-known book. Briefly put, data analysts, as opposed to statisticians, adopt a 
somewhat cavalier attitude toward classical estimations and significance testing and 
are primarily concerned with heuristics and algorithms for data reduction, exposure 
of structure, and exploration of nonlinearities and nonsingularities. Graphical pro- 
cedures play a large role in univariate data analysis and have been adapted success- 
fully to several multivariate situations (179). Gnanadesikan & Wilk have provided 
an excellent overview of the area (180). 


DISCRIMINANT ANALYSIS Discriminant analysis continues to attract attention in 
all its varieties. Two excellent and readable papers by Porebski (387, 388) summa- 
rized current knowledge and illustrated the application of discriminant analysis 
technology. Urbakh (478) showed that the loss of discriminating power due to 
deletion of a single variate is 


A? — B? „/P 


where A? is the Mahalanobis distance, 8, is the standardized discriminant coeff- 
cient, and PX is the diagonal element out of the inverse of the correlation matrix. 
This formula, of course, is directly analogous to the formula for the reduction in 
squared multiple correlation. Han (206, 208) discussed discriminant analysis with 
unequal covariance matrices, and Lachenbruch (305-308) in a series of papers 
explored misclassification probabilities as related to sample size and other factors. 
Geisser (174) introduced some Bayesian notions into discriminant analysis. 


MANOVA In the rest of this section, we briefly review some other recent develop- 
ments related to MANOVA and discriminant analysis. The robustness of multivari- 
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ate techniques with respect to heteroscedasticity and lack of normality has been 
studied by Mardia (336) and Ito (245). Both authors reported that multivariate test 
procedures are robust to non-normality when testing hypotheses about mean vectors 
but not necessarily so when testing hypotheses about variance-covariance matrices. 
Heteroscedasticity can be a severe problem, and Ito (245) developed some alterna- 
tive test statistics for use when marked heteroscedasticity is present. Korin (294), 
however, reported substantial robustness. Holloway (233) examined the robustness 
of Hotelling’s T? statistic. 

Two papers (193) discussed the analysis of growth curves by multivariate tech- 
niques. Grizzle’s (192) paper on this topic also illustrates the use of planned compar- 
isons in MANOVA. Two papers dealt with the problem of missing data in 
discriminant analysis. Jackson (246) and Chan & Dunn (85) found substitution of 
the mean for missing values to be a satisfactory procedure compared with other 
alternatives. 

One-sided multivariate significance tests have been investigated by Kudo (302), 
Kudo & Fujisawa (303), and Perlman (379, 380). Generally speaking, these tests can 
be applied to one-sample cases with hypotheses that the mean vector is zero against 
an alternative that all the components have “slipped to the right.” It appears that 
further work will be necessary before fully usable procedures are available, although 
clearly a significant start has been made. 

A number of papers have dealt with the distribution of test statistics used in 
MANOVA and other types of multivariate analysis. These papers generally are 
quite technical and beyond the reach of a psychological audience. Several, however, 
contain useful results or tables and are cited here (211, 249, 253, 363, 382-384). 

In closing this section, we want to draw special attention to the recent work of 
Jöreskog (264) on simultaneous factor analysis in several populations. This promises 
to be an extraordinarily useful statistical technique with numerous psychological 
applications. Briefly, his method consists of determining maximum likelihood esti- 
mators of factor pattern matrices, factor intercorrelation matrices, and uniquenesses 
under models of increasing degrees of stringency and evaluating the likelihood ratio 
test statistic for each model. 


Ordinal Data 


In this section we are going to deal solely with a new class of nonparametric 
multivariate procedures for testing the hypothesis of equality of location parameters 
for two or more goups. These procedures have been discussed by Puri & Sen (399, 
401), Anderson (11), Bhapkar (49), and Bell & Smith (45). Work is under way to 
extend multivariate nonparametric methods to general linear (MANOVA) models 
and include estimation problems as well (401, 441). 

Consider a p-dimensional random vector with continuous distribution function 
F(X), where the subscript ki = 1, ..., K) indexes various treatments or experi- 
mental groups. We suppose a location shift as a consequence of experimental treat- 
ments, i.e. 
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F(X) = F(X + 8.) 


where ©, is a vector of constants. Otherwise the multivariate distributions are 
identical. We desire to test the null hypotheses 


H. 8, Kaes O 
for all k versus 
H, e 9, ze O. 


Suppose Dr observations have been made on the random vector x under treatment 
condition k. We now rank all 


2k nN) 


observations on each of the p-variables separately and form the ranks into an N X p 
matrix R in which the first n, rows correspond to the sample of observations 
obtained under treatment conditions one, etc. The ranks can be transformed further. 
For example: let 

(i) _ Ji ifr; < (N+ DY 

5 IO otherwise 

(ii) Py = rj/(N + 1) 

(iii) P,=PUr,/(N+D 
These lead to a median test, a rank-sum test, and normal scores test, respectively, 
if $-! denotes the inverse normal transformation. The test'statistic involves calculat- 
ing the mean vector of the P matrix for each subsample and the grand mean vector. 
Denote by T, the mean vector of P corresponding to the Ath sample or treatment, 
by P the grand mean. Further, calculate a variance-covariance matrix V from the 
matrix P, The proposed test statistic is 


L= ZE nT, - PI VOIT, - P). 


This statistic is not fully distribution-free since it depends on the rank correlations 
of the observations, but a distribution-free test can be determined by considering the 
permutation rank order distribution of the statistic. Puri & Sen (399, 401) have 
shown that for large samples, the permutation distribution of L is asymptotically 
chi-square with p(K - 1) degrees of freedom. For small samples, the permutation 
distribution of L would have to be determined for each separate case. 

Modifications of the foregoing procedures have been discussed by Patel & Bradley 
(373). Tamura (464) has developed methods for comparing each treatment with a 
control. Mantel (334) and Anderson (11) have described approaches alternative to 
the permutation rank order procedures that appear very interesting. Most of this 
literature is highly technical. A less technical review paper with applications is 
sorely needed for the applied worker. 
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90-95 
observing behavior, 91- 
93 
preference for signaled 
reinforcement, 93-95 
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326-27 
Figure drawing 
by children, 11 
Flicker 
Fourier analysis in, 207- 
8 
Flutter 
experiments on, 235, 243 
Fourier analysis 
and spatial vision, 207-14, 
216-17, 220 
Freeloading 
animals’ preference for 
work, 95-97 
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organization development, Illusions 
313-41 Delboef 
Guidance Study developmental aspects of, 
results of, 258 10 
Müller-Lyer 
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socioeconomic status com- 
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